Warning: file_get_contents(/data/phpspider/zhask/data//catemap/4/r/75.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Xml 从R中的网页返回链接列表_Xml_R_Web Scraping - Fatal编程技术网

Xml 从R中的网页返回链接列表

Xml 从R中的网页返回链接列表,xml,r,web-scraping,Xml,R,Web Scraping,我试图在r中编写一个函数,给定一个地址,它将返回该网页上的链接列表 例如: getLinks("http://prog21.dadgum.com/109.html") 将返回: "http://prog21.dadgum.com/prog21.css" "http://prog21.dadgum.com/atom.xml" "http://prog21.dadgum.com/index.html" "http://prog21.dadgum.com/archives.html" "http:/

我试图在r中编写一个函数,给定一个地址,它将返回该网页上的链接列表

例如:

getLinks("http://prog21.dadgum.com/109.html")
将返回:

"http://prog21.dadgum.com/prog21.css"
"http://prog21.dadgum.com/atom.xml"
"http://prog21.dadgum.com/index.html"
"http://prog21.dadgum.com/archives.html"
"http://prog21.dadgum.com/atom.xml"
"http://prog21.dadgum.com/56.html"
"http://prog21.dadgum.com/39.html"
"http://prog21.dadgum.com/109.html"
"http://prog21.dadgum.com/108.html"
"http://prog21.dadgum.com/107.html"
"http://prog21.dadgum.com/106.html"
"http://prog21.dadgum.com/105.html"
"http://prog21.dadgum.com/104.html"

此函数似乎适用于其他网页,但由于某些原因,它不会返回有关网页的完整URL。我很想知道是否有更好的方法

getLinks <- function(URL) {
    require(XML)
    doc <- htmlParse(URL)
    out <- unlist(doc['//@href'])
    names(out) <- NULL
    out
}
getLinks