将情绪列添加到r中的数据集
我在r中做了一些基本的情绪分析,想知道是否有一种方法可以分析句子或行的情绪,然后在列中附加句子的情绪。到目前为止,我所做的所有分析都给了我一个关于情绪的概述,或是提取了一些特定的词语,但没有链接回原始数据行 我的数据输入将通过BI软件输入,并显示如下,带有案例编号和一些文本:将情绪列添加到r中的数据集,r,data-analysis,data-science,sentiment-analysis,R,Data Analysis,Data Science,Sentiment Analysis,我在r中做了一些基本的情绪分析,想知道是否有一种方法可以分析句子或行的情绪,然后在列中附加句子的情绪。到目前为止,我所做的所有分析都给了我一个关于情绪的概述,或是提取了一些特定的词语,但没有链接回原始数据行 我的数据输入将通过BI软件输入,并显示如下,带有案例编号和一些文本: "12345","I am extremely angry with my service" "23456","I was happy with how everything turned out" "34567","Th
"12345","I am extremely angry with my service"
"23456","I was happy with how everything turned out"
"34567","The rep did a great job helping me"
我希望它能作为下面的输出返回
"12345","I am extremely angry with my service","Anger"
"23456","I was happy with how everything turned out","Positive"
"34567","The rep did a great job helping me","Positive"
任何一点在一个包或资源的正确方向将不胜感激 你在句子中遇到的问题是情感词汇是基于单词的。如果你看看nrc的词汇,“愤怒”一词有三个情感价值:愤怒、厌恶和消极。你选择哪一个?或者你让这个句子返回一个词典中的多个单词。试着用你的文本测试不同的词汇,看看会发生什么,例如
tidytext
如果你想要一个能够在句子层面上分析情绪的软件包,你可以查看感伤器
。你不会得到像愤怒这样的情绪值,而是情绪/极性得分。有关感伤者的更多信息,请参见github页面上的和
一个小示例代码:
library(sentimentr)
text <- data.frame(id = c("12345","23456","34567"),
sentence = c("I am extremely angry with my service", "I was happy with how everything turned out", "The rep did a great job helping me"),
stringsAsFactors = FALSE)
sentiment(text$sentence)
element_id sentence_id word_count sentiment
1: 1 1 7 -0.5102520
2: 2 1 8 0.2651650
3: 3 1 8 0.3535534
# add sentiment score to data.frame
text$sentiment <- sentiment(text$sentence)$sentiment
text
id sentence sentiment
1 12345 I am extremely angry with my service -0.5102520
2 23456 I was happy with how everything turned out 0.2651650
3 34567 The rep did a great job helping me 0.3535534
库(感伤器)
文本感谢您在正确的方向推动!我想这就是我要找的。