Python 3.x 重复行_Python 3.x_Xpath_Web Scraping_Scrapy - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/8/python-3.x/18.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 3.x 重复行_Python 3.x_Xpath_Web Scraping_Scrapy - Fatal编程技术网

Python 3.x 重复行

python-3.x xpath web-scraping scrapy

Python 3.x 重复行,python-3.x,xpath,web-scraping,scrapy,Python 3.x,Xpath,Web Scraping,Scrapy,我正在努力浏览这个网站。特别是，我正在努力浏览网站上的3个表格我和你一起解决了这个问题 tables = response.xpath('//*[@class="table table-stripefd"]') 然后我想得到表格的每一行，我用 rows = tables.xpath('//tr') 这里的问题是，在抓取并打印出一些数据之后，我注意到有些行有多个条目。例如，“Tahko vuorijuoksu”事件曾出现在网站上，但在我收集的数据中，我有3个实例有人

我正在努力浏览这个网站。特别是，我正在努力浏览网站上的3个表格

我和你一起解决了这个问题

tables = response.xpath('//*[@class="table table-stripefd"]')

然后我想得到表格的每一行，我用

rows = tables.xpath('//tr')

这里的问题是，在抓取并打印出一些数据之后，我注意到有些行有多个条目。例如，“Tahko vuorijuoksu”事件曾出现在网站上，但在我收集的数据中，我有3个实例

有人能指出发生这种情况的原因吗？

当您像这样使用选择器时：

rows = tables.xpath('//tr')

rows = tables.xpath('.//tr') # notice the .

for table in tables:
    rows = table.xpath('tr')

它将选择其自身或子代轴中的每个

tr

元素，不受父元素的限制。因此，对于3个

表元素中的每一个，它将返回所有207个tr
元素
要仅获取每个表的tr
元素child，您可以这样使用它：
rows = tables.xpath('//tr')

rows = tables.xpath('.//tr') # notice the .

for table in tables:
    rows = table.xpath('tr')

通常，这样写更直观：
rows = tables.xpath('//tr')

rows = tables.xpath('.//tr') # notice the .

for table in tables:
    rows = table.xpath('tr')

不过，这只是一个建议，前面的解决方案效果很好。
啊，我想这将是一个非常简单的错误。也许有一天我能真正理解xpath




[xpath]相关文章推荐



                                                        
如何选择节点的第一个子名称？XPath
xpath 
Xpath 如何使用xquery循环xml结构？
xpathxquery 
如何在XPath中查找特定类型的所有节点
xpath 
维基百科摘要上的XPath
xpath 
如何获取a<；中的值；td>；带有xpath/htmlwebunit的标记
xpath 
Selenium IE支持使用XPATH访问定位器
xpathselenium 
Xpath <；td>；未使用HtmlUnit返回值
xpath 
如何在xpath中处理文本节点中的括号？
xpath 
在Groovy中使用Xpath删除XML中的节点
xpathgroovy 
TIBCO BusinessWorks XPath联合运算符
xpath 
动态的xpath实现<；a>；或<；ul>；或<；李>；标签
xpath 
使用XPath查找内部文本的精确匹配
xpath 
使用算术运算符的Xpath 1.0
xpath 
带有Xpath表达式的iReport子报表
xpathjasper-reports 
Xpath 如何选择表中的行并选择其旁边的特定单选按钮
xpathselenium-webdriver 
Xpath：规范化最终节点上的空间
xpath 
用xpath表达式提取xml内容
xpath 
如何通过xpath排除某些关键字和ID
xpath 
XPath：在两个相似的标记之间匹配文本
xpathweb-scrapingscrapy 
xpath表示带货币的价格

$3.95
$13.95
xpath 
                                       





随机文章推荐



                                                        
Mod rewrite 通过RewriteCond-f检查激活站点维护？
mod-rewrite 
Mod rewrite mod_rewrite>/user/tag1/tag2/tag..？第页=1
mod-rewrite 
Mod rewrite mod_重写规则不起作用
mod-rewrite 
Mod rewrite htaccess重写
mod-rewrite 
Mod rewrite mod#u rewrite+&引用；第/123页/部分文字“-&燃气轮机&引用；showpage.php？索引=123&；标题=一些文本“；
mod-rewrite 
Mod rewrite 改写规则混乱
mod-rewrite 
Mod rewrite mod_重写为404处理程序
mod-rewrite 
Mod rewrite 搜索引擎优化友好的URL问题
mod-rewriteurl-rewritingseo 
Mod rewrite http重写条件和重写规则
mod-rewrite 
Mod rewrite .htaccess$\u GET['；p'；]不工作
mod-rewrite 
Mod rewrite 重写规则不'；I don’我没想到会这样
mod-rewrite 
Mod rewrite 子文件夹中子文件夹的Mod rewrite
mod-rewrite 
Mod rewrite 混淆Apache以充当代理服务器
mod-rewriteproxy 
Mod rewrite mod_重写协助
mod-rewrite 
Mod rewrite 重写规则在新服务器上的行为不同
mod-rewrite 
Mod rewrite mod_重写模式以处理带有参数的页面
mod-rewrite 
Mod rewrite mod_重写如何将root传输到特定文件
mod-rewrite 
Mod rewrite mod rewrite如何添加具有不同组合的多个规则
mod-rewrite 
Mod rewrite 使用GET变量创建mod_重写规则
mod-rewrite


                                        

                                        
                                        


                                                
                                                        [python 3.x]相关推荐
                                                        
                                                        
                                                

                                                
                                                        Tags
                                                        
Parse Platform
Deployment
Qt4
Ssl
Graphql
Security
Dynamics Crm
Computer Vision
C++
Sublimetext2
Streaming
Swift2
Neo4j
Autodesk Forge
Yaml
Apache Spark
Phpstorm
Matplotlib
Listview
Swagger
Stripe Payments
Bison
Xquery
String
Redis
Microservices
Time Complexity
Apache2
Libgdx
Report
Django Models
Arangodb
Orm
Windows Installer
Unity3d
Azure Cosmosdb
File Io
Keycloak
Stata
Openshift
Scheme
Tcp
Jaxb
Apache Pig
Ada
Hyperlink
Date
Uitableview
Frameworks
Prestashop
Msbuild
Hazelcast
Jdbc
Xna
Google Colaboratory
Error Handling
Shopify
Gnuplot
Artifactory
Cocoa Touch
Identityserver4
Tomcat
Asp.net Core
Xcode4
Xml
Numpy
Mono
Discord
Rspec
Windows Store Apps
Url Rewriting
Tfs
Oauth 2.0
Cakephp
Oracle Apex
Atom Editor
Google Sheets
Aws Lambda
Excel
Linux
Asp.net Mvc 4
Api
Twilio
Phpmyadmin
Jmeter
Video
Swing
Data Structures
Wolfram Mathematica
Ember.js
Llvm
Anaconda
Apache Camel
Jquery Mobile
Google Plus
Twitter
Discord.js
Angular6
Nosql
Jms
Xpages
Subsonic
Objective C
Redirect
Jestjs
Gremlin
Architecture
Asp.net
Influxdb
Text
Axapta
Apache
Ibm Mobilefirst
Import
Transactions
Xamarin.ios
Xamarin.android
Fluent Nhibernate
Windows Phone 8
Cuda
Pascal
Seo
Camera
Replace
Compilation
Menu
Datatables
Domain Driven Design
Ethereum
Doctrine
Serial Port
Macos
Mule
Grep
Unix
Migration
Xslt
Npm
Sip
Linq To Sql
Doctrine Orm
Drupal 7
Apache Flink
D
Primefaces
Wpf
Omnet++
Rally
Server
Yii2
Windows Phone 7
Notepad++
Sencha Touch
Three.js
Azure Devops
Canvas
Cluster Computing
Statistics
Network Programming
Web Services
Webpack
Cobol
Sed
Python 2.7
Select
Dask
Microsoft Graph Api
Mpi
Process
Hive
Ios
Cloud
Floating Point
Fortran
Apache Kafka
Clearcase
Utf 8
Cmake
Dictionary
Triggers
Google Maps Api 3
Isabelle
Silverlight 4.0
Safari
Jetty
Plone
Zsh
System Verilog
Actionscript 3
Gtk
Firebase
E Commerce
Geolocation
Sql Server 2005
Sharepoint 2010
Single Sign On
Certificate
Cmd
Akka
Compiler Errors
Google Drive Api


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网