Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/361.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 带lxml.html的Webscaping Scopus_Python_Web Scraping_Python Requests_Lxml.html_Scopus - Fatal编程技术网

Python 带lxml.html的Webscaping Scopus

Python 带lxml.html的Webscaping Scopus,python,web-scraping,python-requests,lxml.html,scopus,Python,Web Scraping,Python Requests,Lxml.html,Scopus,我正试图用lxml.html(最终创建一个文档标题列表)对Scopus进行webscrape,但似乎没有从page.content存储任何数据;结果列表(tr_元素)最终为空 import requests import lxml.html as lh url = 'https://www.scopus.com/results/citedbyresults.uri?sort=plf-f&cite=2-s2.0-84939544008&src=s&nlo=&nlr

我正试图用lxml.html(最终创建一个文档标题列表)对Scopus进行webscrape,但似乎没有从page.content存储任何数据;结果列表(tr_元素)最终为空

import requests
import lxml.html as lh

url = 'https://www.scopus.com/results/citedbyresults.uri?sort=plf-f&cite=2-s2.0-84939544008&src=s&nlo=&nlr=&nls=&imp=t&sid=fdbfeac69ab848bdff16425dc6937ffc&sot=cite&sdt=a&sl=0&origin=resultslist&offset=1&txGid=b63ddae0b71deb5a4615640f49db9904'
page = requests.get(url)
doc = lh.fromstring(page.content)
tr_elements = doc.xpath('//tr')

由于inspect元素显示行具有不同的类(),因此我也尝试使用
tr_elements=doc.xpath(“//tr[contains(@class,'searchArea')]”)运行它,指定要分析的行,但这也会在空列表中结束。有什么想法吗?

我想出来了。拒绝访问|使用Cloudflare限制访问