rvest:尝试从网站构建csv/表

rvest:尝试从网站构建csv/表,r,web-scraping,rvest,R,Web Scraping,Rvest,我正在尝试构建一个表/数据框/csv文件,该文件如下所示: ( [City1, State1], OverallScore1, QualityOfLife1, Value1 ) ( [City2, State2], OverallScore2, QualityOfLife2, Value2 ) ... ( [CityN, StateN], OverallScoreN, QualityOfLifeN, ValueN ) 对于单个(城市、州),我只能获取三个值(总体得分、生活质量和值)中的一个,代

我正在尝试构建一个表/数据框/csv文件,该文件如下所示:

( [City1, State1], OverallScore1, QualityOfLife1, Value1 )
( [City2, State2], OverallScore2, QualityOfLife2, Value2 )
...
( [CityN, StateN], OverallScoreN, QualityOfLifeN, ValueN )
对于单个
(城市、州)
,我只能获取三个值(
总体得分
生活质量
)中的一个,代码为:

library(rvest)
live_movie <- read_html("https://realestate.usnews.com/places/rankings/best-places-to-live")
live %>%
  html_node('#main-well') %>%
  html_node('.text-large-for-small-only') %>%
  html_text()
库(rvest)
现场电影%
html#U节点('#主井')%>%
html_节点('.text large for small only')%>%
html_text()

是否有一种方法可以一次性获取上述所有字段?

请根据评论中提到的指导原则使用代码

library(rvest)
url<- "https://realestate.usnews.com/places/rankings/best-places-to-live"
page<-read_html(url)

overall_score<-html_nodes(page,css=".text-tightest:nth-child(1) .text-coal") %>% html_text()
overall_score<-as.numeric(gsub(" Overall Score","",overall_score))

life_quality<-html_nodes(page, css=".text-tightest:nth-child(2) .text-coal") %>% html_text()
life_quality<-as.numeric(gsub("[\r\n QualityofLife]", "", life_quality))

value<-html_nodes(page, css=".border-left-for-medium-up+ .text-tightest .text-coal") %>% html_text()
value<-as.numeric(gsub("[\r\n Value]", "", value))

heading<-html_nodes(page, css=".heading-large a") %>% html_text()
city<-sapply(heading,function(x){strsplit(x,split=", ")[[1]][1]})
state<-sapply(heading,function(x){strsplit(x,split=", ")[[1]][2]})


real_estate<-data.frame(city,state,overall_score,life_quality,value, row.names = NULL)
库(rvest)

url请根据评论中提到的指南使用代码

library(rvest)
url<- "https://realestate.usnews.com/places/rankings/best-places-to-live"
page<-read_html(url)

overall_score<-html_nodes(page,css=".text-tightest:nth-child(1) .text-coal") %>% html_text()
overall_score<-as.numeric(gsub(" Overall Score","",overall_score))

life_quality<-html_nodes(page, css=".text-tightest:nth-child(2) .text-coal") %>% html_text()
life_quality<-as.numeric(gsub("[\r\n QualityofLife]", "", life_quality))

value<-html_nodes(page, css=".border-left-for-medium-up+ .text-tightest .text-coal") %>% html_text()
value<-as.numeric(gsub("[\r\n Value]", "", value))

heading<-html_nodes(page, css=".heading-large a") %>% html_text()
city<-sapply(heading,function(x){strsplit(x,split=", ")[[1]][1]})
state<-sapply(heading,function(x){strsplit(x,split=", ")[[1]][2]})


real_estate<-data.frame(city,state,overall_score,life_quality,value, row.names = NULL)
库(rvest)

url@hrbrmstr上述操作会违反哪一部分?@hrbrmstr此数据仅供我个人使用。我感兴趣的是编写关于我应该考虑的地方的信息。@ HRBRMSTR哪一个部分将违反上述操作?@ HRBRSTRT。这些数据是我个人使用的。我对编写我应该考虑的地方的信息感兴趣。