R 过滤掉这个词_R - Fatal编程技术网

R 过滤掉这个词

R 过滤掉这个词,r,R,我想从Critist_smart中筛选出rm_单词中的常用词，并创建一个条形图来显示前20个最常用的单词。这是我的密码： rm_words <- c("animal", "crossing", "horizons", "game", "nintendo", "switch", "series", "island

我想从Critist_smart中筛选出rm_单词中的常用词，并创建一个条形图来显示前20个最常用的单词。这是我的密码：

rm_words <- c("animal", "crossing", "horizons", "game", "nintendo", 
              "switch", "series", "island")
p1 <- critic_smart %>%
  filter((str_detect(word, rm_words))) %>%
  count(word) %>%
  slice_max(n, n = 20) %>% 
  ggplot(aes(x = n, y = fct_reorder(word, n))) +
  geom_col() +
  labs(x = "Frequency of words", y = "")
p1

rm\u单词%
计数（字）%>%
切片最大值（n，n=20）%>%
ggplot（aes（x=n，y=fct_-reorder（word，n）））+
geom_col（）+
实验室（x=”词频“，y=”）
p1

代码只计算rm_单词中的单词，但事实上，我想删除rm_单词中的这些单词，我能做些什么来过滤掉它。谢谢。

如果

critist\u smart

中的单词只与

rm\u words

中的单词精确匹配，那么您可以测试是否包含在向量中，

过滤器（！（word%在%rm\u words中））

如果

critist\u smart$word

中的单词可能包含与

rm\u单词

中的某个内容相等的子字符串，并且您希望在该场景中删除该子字符串，则可以将

rm\u单词

向量生成正则表达式模式，用于

grepl

或

str\u detect

过滤器（！grepl（粘贴（rm_words，collapse=“|”）word））

您能给出一个可重复的示例吗。我们不可能知道什么是

critist\u smart

。看起来，你可以试试

！str_detect（word，rm_words）

而不是

str_detect（word，rm_words）

。我已经试过了！str_detect（word，rm_words）它不起作用，并且该词不通过此函数过滤OK，因此请提供一个可复制的示例。