word cloud-strwidth中的错误(单词[i],cex=size[i],…):无效';cex&x27;价值

word cloud-strwidth中的错误(单词[i],cex=size[i],…):无效';cex&x27;价值,r,word-cloud,R,Word Cloud,我正在复制word cloud教程,但我得到: strwidth错误(字[i],cex=size[i],…):无效的“cex”值 此外:警告信息: 1:在max(freq)中:max没有未丢失的参数;返回-Inf 2:在max(freq)中:max没有未丢失的参数;返回-Inf 我不太明白代码的每一步都发生了什么,但我认为问题可能与生成的矩阵具有不同的行或列有关。这是我正在使用的代码: install.packages(c("devtools", "rjson", "bit64", "httr"

我正在复制word cloud教程,但我得到:

strwidth错误(字[i],cex=size[i],…):无效的“cex”值 此外:警告信息: 1:在max(freq)中:max没有未丢失的参数;返回-Inf 2:在max(freq)中:max没有未丢失的参数;返回-Inf

我不太明白代码的每一步都发生了什么,但我认为问题可能与生成的矩阵具有不同的行或列有关。这是我正在使用的代码:

install.packages(c("devtools", "rjson", "bit64", "httr"))

library(devtools)
install_github("twitteR", username="geoffjentry")
library(twitteR)

##
api_key= "xxxxxx"
api_secret= "xxxxxx"
access_token="xxxxxxxxxxxx"
access_token_secret= "xxxxxx"
setup_twitter_oauth(api_key,api_secret,access_token,access_token_secret)

searchTwitter("amlo")

library(twitteR)
install.packages("tm")
library(tm)
install.packages("wordcloud")
library(wordcloud)
library(RColorBrewer)

mh370 <- searchTwitter("#PrayForMH370", since = "2014-03-08", until = "2014-03-20", n =             1000)
mh370_text = sapply(mh370, function(x) x$getText())
mh370_corpus = Corpus(VectorSource(mh370_text))

tdm = TermDocumentMatrix(
  mh370_corpus,
  control = list(
    removePunctuation = TRUE,
    stopwords = c("prayformh370", "prayformh", stopwords("english")),
    removeNumbers = TRUE, tolower = TRUE)
)

m = as.matrix(tdm)
# get word counts in decreasing order
word_freqs = sort(rowSums(m), decreasing = TRUE) 
# create a data frame with words and their frequencies
dm = data.frame(word = names(word_freqs), freq = word_freqs)
wordcloud(dm$word, dm$freq, random.order = FALSE, colors = brewer.pal(8, "Dark2"))
install.packages(c(“devtools”、“rjson”、“bit64”、“httr”))
图书馆(devtools)
安装github(“twitteR”,username=“geoffjentry”)
图书馆(推特)
##
api_key=“xxxxxx”
api_secret=“xxxxxx”
访问令牌=“XXXXXXXXXX”
访问\u令牌\u密码=“xxxxxx”
设置(api密钥、api密钥、访问令牌、访问令牌密钥)
searchTwitter(“amlo”)
图书馆(推特)
安装软件包(“tm”)
图书馆(tm)
安装程序包(“wordcloud”)
图书馆(wordcloud)
图书馆(RColorBrewer)

mh370问题在于,来自tm pacakge的TermDocumentMatrix函数的默认行为是只跟踪长度超过三个字符的单词

因此,只需将此参数wordlength=c(0,Inf)添加到TermDocumentMatrix的控制列表中:

tdm = TermDocumentMatrix(
     mh370_corpus,
     control = list(
     wordLengths=c(0,Inf),
     removePunctuation = TRUE,
     stopwords = c("prayformh370", "prayformh", stopwords("english")),
     removeNumbers = TRUE, tolower = TRUE) )