Python 使用xpath和scrapy从HTML中提取特定值_Python_Html_Xpath_Scrapy

Python 使用xpath和scrapy从HTML中提取特定值

python html xpath scrapy

Python 使用xpath和scrapy从HTML中提取特定值,python,html,xpath,scrapy,Python,Html,Xpath,Scrapy,我有以下html代码： 19:00 怎么了 response.xpath'/a/@target'.extract 为什么？如果格式化HTML，错误是显而易见的您希望从标记中提取文本，而不是从目标属性中提取文本 scrapy的xpath使用了这个选择器，你有这样一个bug的例子吗？请看一看：我明白了，也许它可以使用正确的内容类型 <tr data-live="COumykPG" data-dt="10,11,2017,19,00" data-def="1">

我有以下html代码：

19:00 怎么了 response.xpath'/a/@target'.extract

为什么？如果格式化HTML，错误是显而易见的

您希望从标记中提取文本，而不是从目标属性中提取文本

scrapy的xpath使用了这个选择器，你有这样一个bug的例子吗？请看一看：我明白了，也许它可以使用正确的内容类型

        <tr data-live="COumykPG" data-dt="10,11,2017,19,00" data-def="1">
           <td class="table-matches__tt">
              <span class="table-matches__time" data-live-cell="time">19:00</span>
              <a href="/soccer/germany/oberliga-bremen/oberneuland-habenhauser/COumykPG/" data-live-cell="matchlink">
              <span>Oberneuland</span> - <span>Habenhauser</span>
              </a>
           </td>
           <td class="livebet" data-live-cell="livebet">&nbsp;</td>
           <td class="table-matches__streams" data-live-cell="score"></td>
           <td class="table-matches__odds" data-oid="2p2k5xv464x0x6ev9v">

           <a href="/myselections.php?action=3&amp;matchid=COumykPG&amp;outcomeid=2p2k5xv464x0x6ev9v&amp;otheroutcomes=2p2k5xv498x0x0,2p2k5xv464x0x6eva0" 
              onclick="return my_selections_click('1x2', 'soccer');" 
              title="Add to My Selections" 
              target="mySelections">1.10</a>

           </td>
           <td class="table-matches__odds" data-oid="2p2k5xv498x0x0">

           <a href="/myselections.php?action=3&amp;matchid=COumykPG&amp;outcomeid=2p2k5xv498x0x0&amp;otheroutcomes=2p2k5xv464x0x6ev9v,2p2k5xv464x0x6eva0" 
              onclick="return my_selections_click('1x2', 'soccer');" 
              title="Add to My Selections" 
              target="mySelections">7.44</a>

           </td>
           <td class="table-matches__odds" data-oid="2p2k5xv464x0x6eva0">

           <a href="/myselections.php?action=3&amp;matchid=COumykPG&amp;outcomeid=2p2k5xv464x0x6eva0&amp;otheroutcomes=2p2k5xv464x0x6ev9v,2p2k5xv498x0x0" 
              onclick="return my_selections_click('1x2', 'soccer');" 
              title="Add to My Selections" 
              target="mySelections">12.40</a>

           </td>
        </tr>

from scrapy.selector import Selector
result_array = Selector(text=response.body).xpath('//a/text()').extract()