Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/293.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 刮擦返回空数组_Python_Xpath_Css Selectors_Scrapy_Scrapy Spider - Fatal编程技术网

Python 刮擦返回空数组

Python 刮擦返回空数组,python,xpath,css-selectors,scrapy,scrapy-spider,Python,Xpath,Css Selectors,Scrapy,Scrapy Spider,我想用刮痧来爬。所以我打开firefox或chrome for xpath或css选择器来复制这个地址!我明白了!但这个回报是空的!我在在线站点(inspect元素)和response.body中使用jquery进行了尝试。它们都是一样的,但不起作用!例如,我的爬网页面是 地址如下: IN INSPECT ELEMENT: { FIREFOX CSS: html body table tbody tr td div table tbody tr td center table tbody tr

我想用刮痧来爬。所以我打开firefox或chrome for xpath或css选择器来复制这个地址!我明白了!但这个回报是空的!我在在线站点(inspect元素)和response.body中使用jquery进行了尝试。它们都是一样的,但不起作用!例如,我的爬网页面是 地址如下:

IN INSPECT ELEMENT:
{
FIREFOX CSS:
html body table tbody tr td div table tbody tr td center table tbody tr td table tbody tr td table.borderact tbody tr td h1
FIREFOX XPATH:
/html/body/table/tbody/tr/td/div/table[1]/tbody/tr/td[3]/center/table/tbody/tr/td[3]/table[1]/tbody/tr/td[2]/table/tbody/tr[1]/td/h1

CHROME CSS: 
body > table > tbody > tr > td > div > table:nth-child(3) > tbody > tr > td:nth-child(3) > center > table > tbody > tr > td:nth-child(3) > table:nth-child(1) > tbody > tr > td:nth-child(2) > table > tbody > tr:nth-child(1) > td > h1


IN RESPONSE:

FIREFOX CSS:
html body table tbody tr td div table tbody tr td center table tbody tr td table tbody tr td table.\"borderact\" tbody tr td h1
FIREFOX XPATH:
/html/body/table/tbody/tr/td/div/table[1]/tbody/tr/td[3]/center/table/tbody/tr/td[3]/table[1]/tbody/tr/td[2]/table/tbody/tr[1]/td/h1
 CHROME CSS: 
 body > table > tbody > tr > td > div > table:nth-child(3) > tbody > tr > td:nth-child(3) > center > table > tbody > tr > td:nth-child(3) > table:nth-child(1) > tbody > tr > td:nth-child(2) > table > tbody > tr:nth-child(1) > td > h1
}
通常由浏览器插入,以使HTML一致。网站作者通常省略
。默认情况下,Scrapy通过
lxml
查看“原始”HTML,并不试图使其一致。因此,请尝试删除XPath中的
tbody/
步骤。
通常由浏览器插入,以使HTML一致。网站作者通常省略
。默认情况下,Scrapy通过
lxml
查看“原始”HTML,并不试图使其一致。因此,请尝试删除XPath中的
tbody/
步骤。