第一次使用R:如何比较两列

第一次使用R:如何比较两列,r,R,我有一个数据集“world”,其中列“women09”包含百分比值,列“pr_sys”包含“yes”或“no”。我需要找出“women09”中“是”和“否”的百分比。我不知道如何比较这两列 women09 = world[["women09"]] pr_sys = world[["pr_sys"]] wpr = ifelse(pr_sys = "Yes",count.fields(women09)) ifelse中出错(pr_sys=“Yes”,count.fields(women09)):

我有一个数据集“world”,其中列“women09”包含百分比值,列“pr_sys”包含“yes”或“no”。我需要找出“women09”中“是”和“否”的百分比。我不知道如何比较这两列

women09 = world[["women09"]]
pr_sys = world[["pr_sys"]]
wpr = ifelse(pr_sys = "Yes",count.fields(women09))
ifelse中出错(pr_sys=“Yes”,count.fields(women09)):

未使用的参数(pr_sys=“Yes”)

ifelse中出错(pr_sys=“Yes”,wpr=women09,0):

未使用的参数(pr_sys=“Yes”,wpr=women09)

数据:

dput(world[1:64,c(“women09”,“pr_sys”))
w
install.packages(“tidyverse”);图书馆(tidyverse)#既然你说你是R的新手
世界%
分组依据(pr系统)%>%
总结(tot=总和(女性09,na.rm=真实))
#一个tibble:2x2
pr_系统总计
1号49.6
2是18.5
安装程序包(“tidyverse”);图书馆(tidyverse)#既然你说你是R的新手
世界%
分组依据(pr系统)%>%
总结(tot=总和(女性09,na.rm=真实))
#一个tibble:2x2
pr_系统总计
1号49.6
2是18.5

如果您发布(部分)您的数据,将大有帮助。尝试
dput(world[,c(“women08”,“pr_sys”)])
,并将输出添加到您的问题中。这也有助于指示预期的输出,因为您目前不太清楚要做什么。听起来您只需要在
pr\u sys
列中计算是/否,但接下来您要讨论比较列。相等的条件运算符是
==
(不是
=
)。您的
dput
输出不完整。@taylor112不确定这个问题是否有意义。你能澄清一下吗?你已经在“pr_sys”列中有了百分比,那么目标是什么?如果你发布(部分)数据,这将非常有帮助。尝试
dput(world[,c(“women08”,“pr_sys”)])
,并将输出添加到您的问题中。这也有助于指示预期的输出,因为您目前不太清楚要做什么。听起来您只需要在
pr\u sys
列中计算是/否,但接下来您要讨论比较列。相等的条件运算符是
==
(不是
=
)。您的
dput
输出不完整。@taylor112不确定这个问题是否有意义。你能澄清一下吗?“pr_sys”列中已经有百分比,那么目标是什么?
ifelse(pr_sys="Yes",wpr=women09,0)
dput(world[1:64, c("women09", "pr_sys")])

w <- structure(list(women09 = c(NA, NA, 20.5, NA, 18.5, 8.3, 20.8, 
NA, NA, NA, 25, NA, NA, 18.1, 0.9, NA, 0, NA, NA, NA, NA, NA, 
NA, 18.6, NA, 10.5, NA, NA, 4.3, 4.1, NA, NA, NA, NA, 12, NA, 
19, NA, 7, 10.2, NA, NA, NA, NA, NA, 8.4, 3, NA, 8.9, NA, 15.2, 
10.8, 30, 10.7, NA, 18.2, 3.8, NA, NA, 16.4, NA, NA, 4, 13.2),
pr_sys = c("No", "Yes", "No", "Yes", 
"Yes", "No", "No", "No", "No", "No", "No", "No", "No", "Yes", 
"No", "No", "No", "No", "No", "No", "Yes", "No", "Yes", "No", 
"No", "No", "No", "No", "No", "No", NA, NA, "No", NA, "Yes", 
"Yes", "Yes", "No", "No", "No", "No", NA, "No", "No", NA, "No", 
"No", "No", "No", "Yes", "No", "Yes", "Yes", "No", "No", "Yes", 
"No", NA, "No", "No", "No", "Yes", "No", "Yes")), 
class = "data.frame", row.names = c(NA, -64L))
install.packages("tidyverse"); library(tidyverse) #since you said you're new to R
world <- as.data.frame(
  list(women09 = c(NA, NA, 20.5, NA, 18.5, 8.3, 20.8),
       pr_sys = c("No", "Yes", "No", "Yes","Yes", "No", "No"))
  )

world %>%
  group_by(pr_sys) %>%
  summarize(tot = sum(women09, na.rm = TRUE))

# A tibble: 2 x 2
pr_sys   tot
<fct>  <dbl>
1 No      49.6
2 Yes     18.5