Warning: file_get_contents(/data/phpspider/zhask/data//catemap/4/r/81.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/8/perl/11.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
如何匹配R中的相似文档_R_Twitter_Text Mining - Fatal编程技术网

如何匹配R中的相似文档

如何匹配R中的相似文档,r,twitter,text-mining,R,Twitter,Text Mining,我创建了两个小体:一个包含推特文本,另一个包含公司名称。我想做的是找出推文中提到的公司 tweet的示例文档: > writeLines(as.character(tweet_corp[[175]])) general motor send mexican made model chevi cruze us car dealer tax free across border make usaor pay big border tax 公司的示例文件: > writeLines(as

我创建了两个小体:一个包含推特文本,另一个包含公司名称。我想做的是找出推文中提到的公司

tweet的示例文档:

> writeLines(as.character(tweet_corp[[175]]))
general motor send mexican made model chevi cruze us car dealer tax free across border make usaor pay big border tax
公司的示例文件:

> writeLines(as.character(company_corp[[1397]]))
general motor

我想要一个输出匹配tweet_corp[[175]]和company_corp[[1397]]。有什么方法可以做到这一点吗?

您可以使用
stringr
软件包来检查公司名称是否出现在tweet中,例如

library(stringr)

company_name <- "general motor"

tweet <- "general motor send mexican made model chevi cruze us car dealer tax free across border make usaor pay big border tax"

# check whether a company name occurs in a string
str_detect(
  string = tweet,
  pattern = coll(company_name)
)
库(stringr)
公司名称