尝试在R中聚合反应类型

尝试在R中聚合反应类型,r,R,结果样本 Activity ReactionType numberActivities activator activates 16 binding binds 83 recombinase binds 1 branching branches 3 carboxylase carboxylates 36 peptidase cleaves

结果样本

Activity      ReactionType      numberActivities

activator     activates         16
binding       binds             83
recombinase   binds             1
branching     branches          3
carboxylase   carboxylates      36
peptidase     cleaves           425
endopeptidase cleaves           368
nuclease      cleaves           53
glycosylase   cleaves           24
cyclase       converts          12
transhydrogenase converts       3
hist deacetylase deacetylates   8
deacetylase   deacetylates      16
我想数一数所有相同的反应类型,并总结它们的数量

reaction_types <-aggregate(numberActivities ~ ReactionType, unique(outAct), FUN=sum)
问题是,我得到了重复项,也就是说,它们没有被计算为一个唯一的反应类型,例如,输出包含以下行:

deacetylates    8
deacetylates    16
在整个输出文件中有更多类似的示例。 我哪里做错了


提前感谢。

您的字符串中是否有空格?作为.character(唯一(outAct$ReactionType))的输出是什么?Alistaire,我如何检查空格?Jean,我不理解您的评论。你是说在上面的聚合函数中?输出来自作用于outActAdiana的此函数,库dplyr在我的环境中似乎不起作用,并且它无法识别语句out%%>%group_by(ReactionType)%%>%summary(number=sum(numberActivities))eval(expr,envir,enclose)中的错误(expr,envir,enclose):对象“out”未找到如何从我的outAct$ReactionType列中去掉空白?粘贴(反应类型)[1]“c(\'phosphorylates\”、\'cleaves\”、\'transfers\、\'transports\”、\'hydrophyses\”、\'dephosphorylates\、\'reduces\,\”脱氢酶\“,”连接物\“,”交换(核苷酸)\“,”钳位加载器绑定\“,”交换(核苷酸)\“,”绑定\“,”等等,这就是问题所在,一些有空格和反斜杠,另一些只是反斜杠。我如何清理?
library(dplyr)
outAct %>% group_by(ReactionType) %>% summarise(number = sum(numberActivities)) 
library(dplyr)
outAct %>% group_by(ReactionType) %>% summarise(number = sum(numberActivities))