R 加权平均加总
我有下面的数据框R 加权平均加总,r,aggregate,R,Aggregate,我有下面的数据框 library(ggplot2) v = c(100,50,50,20,30,100,100,100,40,5) s = c(1,10, 5, 90,40, 100, 1,1,30,5) g = c(rep("A",5),rep("B",5)) m = c (1,1,5,5,10, 10,10,8,5,2) d = data.frame(value = v, size= s, group = g, m = m) d a = aggregate(d$v, by = list(d
library(ggplot2)
v = c(100,50,50,20,30,100,100,100,40,5)
s = c(1,10, 5, 90,40, 100, 1,1,30,5)
g = c(rep("A",5),rep("B",5))
m = c (1,1,5,5,10, 10,10,8,5,2)
d = data.frame(value = v, size= s, group = g, m = m)
d
a = aggregate(d$v, by = list(d$m, d$g), FUN= mean)
a
ggplot(data=a, aes(x=Group.1, y=x, group=Group.2)) +
geom_line( )+geom_point()
当前,aggregate()取每个“g”和“m”组中的平均值“v”
而不是简单的平均值,我想用“s”和每个“g”和“m”作为权重绘制加权平均值。例如,当g=A和m=1时,聚合返回75,平均值为100&50,但我想返回加权平均值(100*1+50*10)/11=54
最好的办法是什么
谢谢我们可以使用
dplyr
library(dplyr)
d %>%
group_by(m, group) %>%
summarise(vmean = floor(weighted.mean(value, size)))
# m group vmean
# <dbl> <fctr> <dbl>
#1 1 A 54
#2 2 B 5
#3 5 A 21
#4 5 B 40
#5 8 B 100
#6 10 A 30
#7 10 B 100
by(d[c("value", "size")], list(d$group, d$m),
FUN = function(x) weighted.mean(x[,1], x[,2]))