Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/319.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/sharepoint/4.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
从一个页面打印html不';t使用python和urllib2显示所有源页面_Python_Html_Html Parsing_Urllib2 - Fatal编程技术网

从一个页面打印html不';t使用python和urllib2显示所有源页面

从一个页面打印html不';t使用python和urllib2显示所有源页面,python,html,html-parsing,urllib2,Python,Html,Html Parsing,Urllib2,我试图阅读亚马逊的特定页面 req = urllib2.Request('http://www.amazon.com/Upright-Citizens-Brigade-Comedy-Improvisation/dp/0989387801/ref=lp_1_1_6/175-0367440-7496156?ie=UTF8&qid=1376827779&sr=1-6%20buybox._V181901516_.png)%20center%20top%20no-repeat;') req

我试图阅读亚马逊的特定页面

req = urllib2.Request('http://www.amazon.com/Upright-Citizens-Brigade-Comedy-Improvisation/dp/0989387801/ref=lp_1_1_6/175-0367440-7496156?ie=UTF8&qid=1376827779&sr=1-6%20buybox._V181901516_.png)%20center%20top%20no-repeat;')
req.add_header('User-agent', 'Mozilla/5.0\
            (Windows NT 6.2; WOW64) AppleWebKit/537.11 (KHTML, like Gecko)\
            Chrome/23.0.1271.97 Safari/537.11')
response=urllib2.urlopen(req)
html = response.read()
print html

我试图从页面的源代码中显示的新项目“$25.00”中读取价格,但html打印中没有显示该部分。我做错了什么?

您应该使用html解析器,如或。下面是一个使用
lxml
的示例:

parser = etree.HTMLParser()
root = etree.fromstring(html, parser=parser)

print root.xpath('//td[@class="a-text-right dp-new-col"]/a/span/text()')[0]
印刷品:

$25.00
请注意,所需的标记及其值是使用表达式找到的:

XML路径语言XPath是一种用于选择节点的查询语言 来自XML文档

另见:


希望对您有所帮助。

您应该能够用以下内容替换当前的请求url:
http://www.amazon.com/Upright-Citizens-Brigade-Comedy-Improvisation/dp/0989387801/
,然后只需解析html即可找到价格。这里有许多有用的答案:。