R中的基础数据检查和表创建
我正在研究一个新冠病毒感染者的数据集。数据集由2010名个人和四列组成:R中的基础数据检查和表创建,r,R,我正在研究一个新冠病毒感染者的数据集。数据集由2010名个人和四列组成: 死者:如果此人死于电晕(1:是,0:否) 性别:男/女 年龄:人的年龄(从2岁到99岁) 国家:此人来自哪个国家(法国、日本、韩国、印度尼西亚) 我想为你做桌子 每个国家的死亡人数 每种性别的死亡人数,以及 每个国家:死亡人数,按性别分列 除了通过手动计数然后在矩阵中设置值来查找信息外,我还没有找到任何好的方法,这是不可行的。可以使用以下代码检索数据: id = "1CA1RPRYqU9oTIaHfSroitnWr
- 死者:如果此人死于电晕(1:是,0:否)
- 性别:男/女
- 年龄:人的年龄(从2岁到99岁)
- 国家:此人来自哪个国家(法国、日本、韩国、印度尼西亚)
- 每个国家的死亡人数
- 每种性别的死亡人数,以及
- 每个国家:死亡人数,按性别分列
id = "1CA1RPRYqU9oTIaHfSroitnWrI6WpUeBw"
d.corona = read.csv(sprintf("https://docs.google.com/uc?id=%s&export=download",id),header = T)
1) 每个国家的死亡人数
d.corona %>% group_by(country) %>% summarise(number_of_deceased = sum(deceased))
# country number_of_deceased
# <fct> <int>
#1 France 14
#2 indonesia 2
#3 japan 3
#4 Korea 26
d.corona%>%按(国家)分组%>%总结(死亡人数=总和(死亡))
#死亡人数的国家编号
#
#1法国14
#2印度尼西亚2
#3日本3
#4韩国26
2) 每种性别的死亡人数
d.corona %>% group_by(sex) %>% summarise(number_of_deceased = sum(deceased))
# sex number_of_deceased
# <fct> <int>
#1 female 14
#2 male 31
d.corona %>%
group_by(country, sex) %>%
summarise(number_of_deceased = sum(deceased))
# country sex number_of_deceased
# <fct> <fct> <int>
#1 France female 5
#2 France male 9
#3 indonesia female 1
#4 indonesia male 1
#5 japan female 0
#6 japan male 3
#7 Korea female 8
#8 Korea male 18
d.corona%>%按性别分组%>%总结(死亡人数=死亡总数))
#死者的性别数字
#
#1女14
#2男31
3) 每个国家:死亡人数,按性别分列
d.corona %>% group_by(sex) %>% summarise(number_of_deceased = sum(deceased))
# sex number_of_deceased
# <fct> <int>
#1 female 14
#2 male 31
d.corona %>%
group_by(country, sex) %>%
summarise(number_of_deceased = sum(deceased))
# country sex number_of_deceased
# <fct> <fct> <int>
#1 France female 5
#2 France male 9
#3 indonesia female 1
#4 indonesia male 1
#5 japan female 0
#6 japan male 3
#7 Korea female 8
#8 Korea male 18
d.corona%>%
按(国家、性别)分组%>%
总结(死亡人数=死亡总数)
#国家性别死亡人数
#
#1法国女5
#2法国男9
#3印度尼西亚女1
#4印度尼西亚男1
#5日本女性0
#6日本男3
#7韩国女8
#8韩国男18
或者可能:
d.corona %>%
group_by(country) %>%
summarise(number_of_deceased_male = sum(deceased[sex == 'male']),
number_of_deceased_female = sum(deceased[sex == 'female']))
# country number_of_deceased_male number_of_deceased_female
# <fct> <int> <int>
#1 France 9 5
#2 indonesia 1 1
#3 japan 3 0
#4 Korea 18 8
d.corona%>%
按(国家)划分的组别%>%
总结(死亡人数男性=总和(死亡[性别='男性]),
死亡人数女性=总数(死亡[性别='女性])
#国家/地区死亡人数男性死亡人数女性
#
#1法国9 5
#2印度尼西亚1 1
#3日本3 0
#4韩国18 8