R 如何从网页中删除选定的列表项?

R 如何从网页中删除选定的列表项?,r,web-scraping,rvest,R,Web Scraping,Rvest,我正试图将惊奇漫画中的人物(特写、支持、对手等)搬上舞台。现在这些字符位于DOM中的列表中,我无法获得正确的html\u节点()来获取每个字符类型下的所有列表项 下面的代码提取了列出的所有链接,而我只想要属于featured-support-antigators和othercharacter(不适用于X2)的链接 库(rvest) 图书馆(tidyverse) 测试url% html_节点(“li>a”)%>% html_text() 预期结果: # A tibble: 16 x 3 m

我正试图将惊奇漫画中的人物(特写、支持、对手等)搬上舞台。现在这些字符位于DOM中的列表中,我无法获得正确的
html\u节点()
来获取每个字符类型下的所有列表项

下面的代码提取了列出的所有链接,而我只想要属于featured-support-antigators和othercharacter(不适用于X2)的链接

库(rvest)
图书馆(tidyverse)
测试url%
html_节点(“li>a”)%>%
html_text()
预期结果:

# A tibble: 16 x 3
   movie type                  character                  
   <chr> <chr>                 <chr>                      
 1 X2    Featured Characters   Professor Charles Xavier   
 2 X2    Featured Characters   Wolverine (Logan)          
 3 X2    Featured Characters   Storm (Ororo Munroe)       
 4 X2    Featured Characters   Dr. Jean Grey              
 5 X2    Featured Characters   Cyclops (Scott Summers)    
 6 X2    Featured Characters   Rogue (Marie)              
 7 X2    Featured Characters   Iceman (Bobby Drake)       
 8 X2    Supporting Characters Nightcrawler (Kurt Wagner) 
 9 X2    Supporting Characters Pyro (John Allerdyce)      
10 X2    Supporting Characters Mystique (Raven Darkholme) 
11 X2    Supporting Characters Magneto (Erik Lehnsherr)   
12 X2    Antagonists           Col. William Stryker       
13 X2    Antagonists           Sgt. Lyman                 
14 X2    Antagonists           Unnamed Soldiers           
15 X2    Antagonists           Deathstrike (Yuriko Oyama) 
16 X2    Antagonists           Mutant 143 (Jason Stryker)
#一个tible:16 x 3
电影类型角色
1个X2特色人物Charles Xavier教授
2个X2特色人物狼獾(洛根)
3个X2特色人物风暴(奥罗罗·门罗)
4个X2特色人物Jean Grey博士
5个X2特色人物独眼巨人(斯科特·萨默斯)
6个X2特色角色流氓(玛丽)
7个X2特色人物冰人(博比·德雷克)
8个X2配角夜行侠(库尔特·瓦格纳)
9个配角派罗(约翰·阿勒代斯)
10个X2配角Mystique(Raven Darkholme)
11 X2支持字符磁电机(Erik Lehnsherr)
威廉·斯特莱克上校
13 X2拮抗剂莱曼中士
14名匿名士兵
15名X2对抗者死亡罢工(大山由里子)
16个X2拮抗剂突变体143(Jason Stryker)

您可以从以下内容开始-

库(rvest)
图书馆(tidyverse)
测试url%
html_text()
#将废弃数据格式化为所需格式
df%
单独的行(字符,sep=“\\n”)

> head(df)
      movie                type                         characters
1 X2_(film) Featured Characters                             X-Men 
2 X2_(film) Featured Characters          Professor Charles Xavier 
3 X2_(film) Featured Characters                 Wolverine (Logan) 
4 X2_(film) Featured Characters              Storm (Ororo Munroe) 
5 X2_(film) Featured Characters   Dr. Jean Grey   (Apparent death)
6 X2_(film) Featured Characters           Cyclops (Scott Summers) 
> head(df)
      movie                type                         characters
1 X2_(film) Featured Characters                             X-Men 
2 X2_(film) Featured Characters          Professor Charles Xavier 
3 X2_(film) Featured Characters                 Wolverine (Logan) 
4 X2_(film) Featured Characters              Storm (Ororo Munroe) 
5 X2_(film) Featured Characters   Dr. Jean Grey   (Apparent death)
6 X2_(film) Featured Characters           Cyclops (Scott Summers)