数据帧R的列的离群值
我试图删除r中数据集特定列的异常值(替换为NA):数据帧R的列的离群值,r,replace,null,outliers,quantile,R,Replace,Null,Outliers,Quantile,我试图删除r中数据集特定列的异常值(替换为NA): dat2 <- read.csv("~/Rutas Grises/dat2.csv") 定义阈值:f问题在于ifelse() 最终结果将是2种可能性中的1种,您应该通过传递给变量,我们不知道dat2$pct_desti_unicos_sms看起来您的语法对于ifelse是错误的。它应该是pct\u desti\u unicos\u sms f,NA,dat2$pct\u desti\u unicos\u sms) Min. 1
dat2 <- read.csv("~/Rutas Grises/dat2.csv")
定义阈值:
f问题在于ifelse()
最终结果将是2种可能性中的1种,您应该通过传递给变量,我们不知道dat2$pct_desti_unicos_sms
看起来您的语法对于ifelse
是错误的。它应该是pct\u desti\u unicos\u sms f,NA,dat2$pct\u desti\u unicos\u sms)
Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
0.2632 0.5587 1.0470 8.4050 2.5640 100.0000 673
ifelse(test = !is.na(dat2$pct_desti_unicos_sms) & dat2$pct_desti_unicos_sms > f,
yes = pct_desti_unicos_sms <- NA,
no = pct_desti_unicos_sms <- dat2$pct_desti_unicos_sms)
dat2$pct_desti_unicos_sms<-as.numeric(dat2$pct_desti_unicos_sms)
summary(dat2$pct_desti_unicos_sms)
Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
NA NA NA NaN NA NA 1911
dat2$pct_desti_unicos_sms <- ifelse(
test = !is.na(dat2$pct_desti_unicos_sms) & dat2$pct_desti_unicos_sms > f,
yes = NA,
no = dat2$pct_desti_unicos_sms)
ifelse(test = !is.na(dat2$pct_desti_unicos_sms) & dat2$pct_desti_unicos_sms > f,
yes = dat2$pct_desti_unicos_sms <- NA,
no = dat2$pct_desti_unicos_sms <- dat2$pct_desti_unicos_sms)
if(!is.na(dat2$pct_desti_unicos_sms) & dat2$pct_desti_unicos_sms > f) {
dat2$pct_desti_unicos_sms <- NA
}