Python 使用xpath和scrapy从HTML中提取特定值
我有以下html代码: 19:00 怎么了 response.xpath'/a/@target'.extract 为什么? 如果格式化HTML,错误是显而易见的 您希望从标记中提取文本,而不是从目标属性中提取文本Python 使用xpath和scrapy从HTML中提取特定值,python,html,xpath,scrapy,Python,Html,Xpath,Scrapy,我有以下html代码: 19:00 怎么了 response.xpath'/a/@target'.extract 为什么? 如果格式化HTML,错误是显而易见的 您希望从标记中提取文本,而不是从目标属性中提取文本 scrapy的xpath使用了这个选择器,你有这样一个bug的例子吗?请看一看:我明白了,也许它可以使用正确的内容类型 <tr data-live="COumykPG" data-dt="10,11,2017,19,00" data-def="1">
scrapy的xpath使用了这个选择器,你有这样一个bug的例子吗?请看一看:我明白了,也许它可以使用正确的内容类型
<tr data-live="COumykPG" data-dt="10,11,2017,19,00" data-def="1">
<td class="table-matches__tt">
<span class="table-matches__time" data-live-cell="time">19:00</span>
<a href="/soccer/germany/oberliga-bremen/oberneuland-habenhauser/COumykPG/" data-live-cell="matchlink">
<span>Oberneuland</span> - <span>Habenhauser</span>
</a>
</td>
<td class="livebet" data-live-cell="livebet"> </td>
<td class="table-matches__streams" data-live-cell="score"></td>
<td class="table-matches__odds" data-oid="2p2k5xv464x0x6ev9v">
<a href="/myselections.php?action=3&matchid=COumykPG&outcomeid=2p2k5xv464x0x6ev9v&otheroutcomes=2p2k5xv498x0x0,2p2k5xv464x0x6eva0"
onclick="return my_selections_click('1x2', 'soccer');"
title="Add to My Selections"
target="mySelections">1.10</a>
</td>
<td class="table-matches__odds" data-oid="2p2k5xv498x0x0">
<a href="/myselections.php?action=3&matchid=COumykPG&outcomeid=2p2k5xv498x0x0&otheroutcomes=2p2k5xv464x0x6ev9v,2p2k5xv464x0x6eva0"
onclick="return my_selections_click('1x2', 'soccer');"
title="Add to My Selections"
target="mySelections">7.44</a>
</td>
<td class="table-matches__odds" data-oid="2p2k5xv464x0x6eva0">
<a href="/myselections.php?action=3&matchid=COumykPG&outcomeid=2p2k5xv464x0x6eva0&otheroutcomes=2p2k5xv464x0x6ev9v,2p2k5xv498x0x0"
onclick="return my_selections_click('1x2', 'soccer');"
title="Add to My Selections"
target="mySelections">12.40</a>
</td>
</tr>
from scrapy.selector import Selector
result_array = Selector(text=response.body).xpath('//a/text()').extract()