R色顶点图
我有一个包含所有用户及其角色的数据库,我制作了一个包含usera,role,userb的表。我正试图根据每个用户的团队为他们提供一种颜色 下面的SQL代码可用于创建类似的数据集R色顶点图,r,igraph,R,Igraph,我有一个包含所有用户及其角色的数据库,我制作了一个包含usera,role,userb的表。我正试图根据每个用户的团队为他们提供一种颜色 下面的SQL代码可用于创建类似的数据集 select concat('H',ABS(Checksum(NewID()) % 999999)) acentralacc, CONCAT('Role_',CHAR( FLOOR(65 + (RAND() * 25))))role,concat('H',ABS(Checksum(NewID()) % 999999))b
select concat('H',ABS(Checksum(NewID()) % 999999)) acentralacc, CONCAT('Role_',CHAR( FLOOR(65 + (RAND() * 25))))role,concat('H',ABS(Checksum(NewID()) % 999999))bcentralacc
select concat('H',ABS(Checksum(NewID()) % 999999)) centralaccountname, CONCAT('Team_',CHAR( FLOOR(65 + (RAND() * 25))))team
"
您没有提供示例数据,并且我们没有访问您的SQLServer数据库的权限,因此我们无法复制您的环境。然而,在我看来,
V(net)$Team
是一个字符串,而不是一个因子,因此它不能用作向量colr的索引。你需要把团队作为一个数字或一个因素。有没有什么好方法可以让我在不共享整个数据库/表的情况下获得一个样本数据集?我试着在帖子中给大家一个head(),让你们知道数据是什么样的。但是整个表是10 mil条记录。我不知道您的数据,但不是sqlQuery=“SELECT top 100000*FROM MyTable1”
如何sqlQuery=“SELECT top 100*FROM MyTable1”
?是的,我选择100k条左右的记录以获得可表示的视图,选择top 100将提供与头部相同的数据()。我不热衷于共享mutch数据的原因是因为它有点像用户dataOK,那么共享str(users)
的输出如何?
#get library's
install.packages("igraph")
library("igraph")
#edges (connection between users)
connStr <- "Driver={SQL Server};MyCon;Trusted_Connection=TRUE"
dsSqlServerData <- RxSqlServerData(sqlQuery = "SELECT top 100000 * FROM MyTable1", connectionString = connStr)
data <- rxDataStep(dsSqlServerData)
head(data)
acentralacc rol bcentralacc
1 H000062 ADDN_Basis_BBE_intern H000079
2 H000062 ADDN_Basis_BBE_intern H000082
3 H000062 ADDN_Basis_BBE_intern H000092
4 H000062 ADDN_Basis_BBE_intern H000170
5 H000062 ADDN_Basis_BBE_intern H000197
6 H000062 ADDN_Basis_BBE_intern H000233
data$rol <- 1
head(data)
acentralacc rol bcentralacc
1 H000062 1 H000079
2 H000062 1 H000082
3 H000062 1 H000092
4 H000062 1 H000170
5 H000062 1 H000197
6 H000062 1 H000233
data1 <- aggregate(data[,2],data[,-2],sum)
data1 <- data1[order(data1$acentralacc,data1$bcentralacc),]
head(data1)
acentralacc bcentralacc x
1 H000062 H000062 58
9 H000062 H000067 15
17 H000062 H000071 17
25 H000062 H000073 13
33 H000062 H000077 11
41 H000062 H000079 13
#vertices (ID,node attributs)
connStr <- "Driver={SQL Server};MyCon;Trusted_Connection=TRUE"
dsSqlServerData <- RxSqlServerData(sqlQuery = "SELECT distinct CentralAccount,isnull(Team,'No Team') Team FROM MyTable2", connectionString = connStr)
users <- rxDataStep(dsSqlServerData)
head(users)
CentralAccount Team
1 H000062 ICT Customer Services
2 H000067 Financieel Beheer
3 H000070 Acceptatie Team 2 Gent
4 H000071 Acceptatie Team 1 Gent
5 H000073 NL Ond UW & Risk Advice Auto
6 H000076 Incasso
#parsing (converting) users team
users$Team <- iconv(users$Team, "ASCII", "UTF-8", sub="")
net <- graph_from_data_frame(d=data1, vertices=users, directed=T)
#the following should be igraph
class(net)
#removing graphical loops etc
net <- simplify(net, remove.multiple = F, remove.loops = T)
#creating color palet
pal2 <- rainbow(5, alpha=.5)
V(net)$color <- pal2[V(net)$Team]
plot(delete.vertices(simplify(net), degree(net)==0),vertex.size=5,vertex.label=NA,edge.arrow.size=.2)
legend("topleft", unique(c(V(net)$Team)), pch=21,col="#777777", pt.cex=2, cex=.8, bty="n", ncol=1)
colrs<- brewer.pal(length(unique(V(net)$Team)), "Accent")
names(colrs) <- unique(V(net)$Team)
V(net)$color <- colrs[V(net)$Team]
V(net)$color
install.packages("RColorBrewer")
library(RColorBrewer)
colourCount = length(unique(users$Team))
getPalette = colorRampPalette(brewer.pal(9, "Set1"))
colrs<- getPalette(colourCount)
names(colrs) <- unique(V(net)$Team)
V(net)$color <- colrs[V(net)$Team]
V(net)$color
'data.frame': 2481 obs. of 2 variables:
$ CentralAccount: chr "H000062" "H000067" "H000070" "H000071" ...
$ Team : chr "ICT Customer Services" "Financieel Beheer" "Acceptatie Team 2 Gent" "Acceptatie Team 1 Gent" ...