R 如何按组计算相关性

R 如何按组计算相关性,r,R,我正在尝试运行一个迭代for循环来计算因子变量水平的相关性。我的数据集中有32个团队,每个团队有16行数据。我想将每年的积分与每个团队的分数分别关联起来。我可以一个接一个地做,但我想在循环方面做得更好 correlate <- data %>% select(Team, Year, Points_Game) %>% filter(Team == "ARI") %>% select(Year, Points_Game) 使用[i]迭代所有32个团队,以获得

我正在尝试运行一个迭代for循环来计算因子变量水平的相关性。我的数据集中有32个团队,每个团队有16行数据。我想将每年的积分与每个团队的分数分别关联起来。我可以一个接一个地做,但我想在循环方面做得更好

correlate <- data %>%
  select(Team, Year, Points_Game) %>% 
  filter(Team == "ARI") %>% 
  select(Year, Points_Game)
使用[i]迭代所有32个团队,以获得每个团队的年度和积分相关性,这将非常有帮助

require(dplyr)
require(dplyr)

# dummy data
data = data.frame(
  Team = sapply(1:32, function(x) paste0("T", x)),
  Year = rep(c(2000:2009), 32),
  Points_Game = rnorm(320, 100, 10)
)

# find correlation of Year and Points_Game for each team
# r - correlation coefficient
correlate <- data %>%
                group_by(Team) %>% 
                summarise(r = cor(Year, Points_Game))
#虚拟数据 data=data.frame( Team=sapply(1:32,函数(x)0(“T”,x)), 年份=代表(c(2000:2009),32), 点数=规则(32010010) ) #找出每支球队的年度和比赛分数的相关性 #r-相关系数 关联% 组别(组别)%>% 总结(r=cor(年份、分数)
数据表方式:

library(data.table)

# dummy data (same as @Aleksandr's)
dat <- data.table(
  Team = sapply(1:32, function(x) paste0("T", x)),
  Year = rep(c(2000:2009), 32),
  Points_Game = rnorm(320, 100, 10)
)

# find correlation of Year and Points_Game for each Team
result <- dat[ , .(r = cor(Year, Points_Game)), by = Team]
库(data.table)
#虚拟数据(与@Aleksandr的相同)

dat请使用
dput()
(不是
str
head
或图片/屏幕截图)共享您的数据样本,以便其他人可以提供帮助。在这里看到更多,我试图找出如何正确使用dput。这太长了,无法在问题中发布…抱歉,我是新手。
library(data.table)

# dummy data (same as @Aleksandr's)
dat <- data.table(
  Team = sapply(1:32, function(x) paste0("T", x)),
  Year = rep(c(2000:2009), 32),
  Points_Game = rnorm(320, 100, 10)
)

# find correlation of Year and Points_Game for each Team
result <- dat[ , .(r = cor(Year, Points_Game)), by = Team]