如何使用for循环自动化Web垃圾处理
我有一个带有两列的df:id和url。id包含项目id,url包含我想用来抓取父项目id的网站链接。以下是我拥有的df示例: 以下是一个示例df:如何使用for循环自动化Web垃圾处理,r,dataframe,loops,for-loop,web-scraping,R,Dataframe,Loops,For Loop,Web Scraping,我有一个带有两列的df:id和url。id包含项目id,url包含我想用来抓取父项目id的网站链接。以下是我拥有的df示例: 以下是一个示例df: df <- structure(list(id = c("P173165", "P175875", "P175841", "P175730" ), url = c("https://search.worldbank.org/api/v2/projects?
df <- structure(list(id = c("P173165", "P175875", "P175841", "P175730"
), url = c("https://search.worldbank.org/api/v2/projects?format=json&fl=*&id=P173165&apilang=en",
"https://search.worldbank.org/api/v2/projects?format=json&fl=*&id=P175875&apilang=en",
"https://search.worldbank.org/api/v2/projects?format=json&fl=*&id=P175841&apilang=en",
"https://search.worldbank.org/api/v2/projects?format=json&fl=*&id=P175730&apilang=en"
)), row.names = c(NA, -4L), class = c("data.table", "data.frame"))
> df
id url
1: P173165 https://search.worldbank.org/api/v2/projects?format=json&fl=*&id=P173165&apilang=en
2: P175875 https://search.worldbank.org/api/v2/projects?format=json&fl=*&id=P175875&apilang=en
3: P175841 https://search.worldbank.org/api/v2/projects?format=json&fl=*&id=P175841&apilang=en
4: P175730 https://search.worldbank.org/api/v2/projects?format=json&fl=*&id=P175730&apilang=en
我想写一段代码,使这个过程自动化,并创建一个包含父项目ID的变量。这就是我想要实现的目标:
id url par_proj_id
1: P173165 https://search.worldbank.org/api/v2/projects?format=json&fl=*&id=P173165&apilang=en P147665
2: P175875 https://search.worldbank.org/api/v2/projects?format=json&fl=*&id=P175875&apilang=en P173883
3: P175841 https://search.worldbank.org/api/v2/projects?format=json&fl=*&id=P175841&apilang=en P170267
4: P175730 https://search.worldbank.org/api/v2/projects?format=json&fl=*&id=P175730&apilang=en P173799
我想我应该在这里使用for循环,但我不确定。有什么想法吗?非常感谢您的帮助。这很简单,但我会选择async,这样您就不必等待每一个了
身份证
[1] "P147665"
id url par_proj_id
1: P173165 https://search.worldbank.org/api/v2/projects?format=json&fl=*&id=P173165&apilang=en P147665
2: P175875 https://search.worldbank.org/api/v2/projects?format=json&fl=*&id=P175875&apilang=en P173883
3: P175841 https://search.worldbank.org/api/v2/projects?format=json&fl=*&id=P175841&apilang=en P170267
4: P175730 https://search.worldbank.org/api/v2/projects?format=json&fl=*&id=P175730&apilang=en P173799