在R中,有没有一种方法可以将缺失观测值的不同模式合并到列中?
我有一些我想加在一起的变量,但其中有些变量缺少观测值,当加在一起时,会使整行缺少一个或多个变量。例如,假设我的期望值是最后一列在R中,有没有一种方法可以将缺失观测值的不同模式合并到列中?,r,sum,summary,rowsum,R,Sum,Summary,Rowsum,我有一些我想加在一起的变量,但其中有些变量缺少观测值,当加在一起时,会使整行缺少一个或多个变量。例如,假设我的期望值是最后一列 df <- matrix(c(23, NA, 56, NA, NA, 43, 67, NA, 11, 10, 18, 39), byrow = T, nrow = 3) colnames(df)<- c("X", "y", "z", "sum") df X y z sum [1,] 23 NA 56 NA [2,] NA
df <- matrix(c(23, NA, 56, NA, NA, 43, 67, NA, 11, 10, 18, 39), byrow = T, nrow = 3)
colnames(df)<- c("X", "y", "z", "sum")
df
X y z sum
[1,] 23 NA 56 NA
[2,] NA 43 67 NA
[3,] 11 10 18 39
Here is my expectation
df2 <- matrix(c(23, NA, 56, 79,
NA, 43, 67, 110,
11, 10, 18, 39), byrow = T, nrow = 3)
colnames(df2)<- c("X", "Y", "Z", "sum")
df2
X Y Z sum
[1,] 23 NA 56 79
[2,] NA 43 67 110
[3,] 11 10 18 39
How can I get this result?
I am using R version 3.6 on Window 10.
df正如Ben所指出的,我认为你想要的是na.rm=TRUE
,所以类似这样:
df <- matrix(c(23, NA, 56, NA, 43, 67, 11, 10, 18), byrow = T, nrow = 3)
colnames(df)<- c("X", "y", "z")
cbind(df, summ = rowSums(df, na.rm = TRUE))
# X y z summ
# [1,] 23 NA 56 79
# [2,] NA 43 67 110
# [3,] 11 10 18 39
df您使用什么代码对每行进行求和-rowSums
?如果是,您是否包括na.rm=TRUE
?
library(dplyr)
df_frame <- data.frame(df)
df_frame <- df_frame %>%
mutate(summ = rowSums(., na.rm = TRUE))
df_frame
# X y z summ
# 1 23 NA 56 79
# 2 NA 43 67 110
# 3 11 10 18 39
#OR this if you just want to select numeric variables from the dataframe:
df_frame <- data.frame(df)
df_frame <- df_frame %>%
mutate(summ = rowSums(select_if(., is.numeric), na.rm = TRUE))
df_frame
# X y z summ
# 1 23 NA 56 79
# 2 NA 43 67 110
# 3 11 10 18 39