Regex 如何用Scrapy解析表中的特定元素_Regex_Python 3.x_Web Scraping_Scrapy_Scrapy Spider - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/4/regex/20.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/8/python-3.x/16.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Regex 如何用Scrapy解析表中的特定元素_Regex_Python 3.x_Web Scraping_Scrapy_Scrapy Spider - Fatal编程技术网

Regex 如何用Scrapy解析表中的特定元素

regex python-3.x web-scraping scrapy

Regex 如何用Scrapy解析表中的特定元素,regex,python-3.x,web-scraping,scrapy,scrapy-spider,Regex,Python 3.x,Web Scraping,Scrapy,Scrapy Spider,我试图解析表中的某些内容，如下所示： <table class="dataTbl col-4"> <tr> <th scope="row">Rent</th> <td>5.5</td> <th scope=

我试图解析表中的某些内容，如下所示：

<table class="dataTbl col-4">
                        <tr>
                            <th scope="row">Rent</th>
                            <td>5.5</td>
                            <th scope="row">Management</th>
                            <td>3.3</td>
                        </tr>
                        <tr>
                            <th scope="row">Deposit</th>
                            <td>No</td>
                            <th scope="row">Other</th>
                            <td>No</td>
                        </tr>
                        <tr>
                            <th scope="row">Other2</th>
                            <td>No</td>
                            <th scope="row">Insurance</th>
                            <td>Yes</td>
                        </tr>
                                            </table>


租
5.5
管理层
3.3
押金
不
其他
不
其他2
不
保险
对

我的目标是找到特定的行（例如Rent），如果有匹配项，则提取下一个

标记（例如5.5）中的内容

但是我如何用Python实现它呢

我正在使用Python3/Scrapy 1.3.0

感谢[9]：选择器（text=html）.xpath（'//th[text（）=“Rent”]/以下同级：：td[1]'）。extract（）输出[9]：['5.5']

使用

text（）=“Rent”

标识

th

标记

使用

以下同级：：

获取其同级并使用

[1]

获取第一个同级

使用python的正则表达式

r'\>text\<.+\n +\<td\>(\d+\.\d+)'

r'\>text\n你今天是我的英雄：）谢谢！顺便说一句，如果你知道上述技术的好来源，请告诉我。
r'\>text\<.+\n +\<td\>(\d+\.\d+)'




[python 3.x]相关文章推荐



                                                        
                                       





随机文章推荐



                                                        
Parse platform 错误：找不到要更新的101对象
parse-platform 
Parse platform Parse.com推送通知-针对所有人
parse-platform 
Parse platform 为什么PFQuery只从一个类返回一个对象？
parse-platform 
Parse platform 如何下载较旧版本的Parse CLI？
parse-platform 
Parse platform 是否有用于解析本地数据存储的onSave钩子（在iOS上）？
parse-platform 
Parse platform 按完成的方式更新报告，但不会发生
parse-platform 
Parse platform [BFTask isFaulted]：无法识别的选择器发送到实例错误
parse-platform 
Parse platform Parse.com类设计
parse-platform 
Parse platform 为什么查询只返回前100个用户？
parse-platform 
Parse platform 如何获取解析服务器日期时间？
parse-platform 
Parse platform 未接收解析推送通知
parse-platformnotifications 
Parse platform 调度推送通知解析统一
parse-platformunity3d 
Parse platform 解析新iOS 9.3测试版的SDK问题
parse-platform 
Parse platform 为什么我总是得到一个“a”；请求失败，响应代码为401“；当试图通过城市飞艇推进时？
parse-platform 
Parse platform 如何等待ParseCloud.CallFunctionInBackground
parse-platform 
Parse platform Twilio-发件人电话号码xxx不是有效的、支持SMS的入站电话
parse-platformtwilio 
Parse platform 如何使用Parse findObjectsInBackground获取objectID
parse-platform


                                        

                                        
                                        


                                                
                                                        [regex]相关推荐
                                                        
                                                        
                                                

                                                
                                                        Tags
                                                        
Testng
Emacs
Npm
Spring Boot
Oracle10g
Azure Active Directory
Ecmascript 6
Deep Learning
Charts
Notepad++
Jdbc
Gradle
Checkbox
Ipython
Graphics
C# 4.0
Jar
Jakarta Ee
Pine Script
Ethereum
Random
Nestjs
Csv
Language Agnostic
Liferay
Sencha Touch 2
Oracle
Node.js
Llvm
Facebook Graph Api
Serial Port
Wolfram Mathematica
Sql Server
Robotframework
Prolog
Google Chrome
Racket
Animation
Ipad
Rest
Highcharts
Uiview
Requirejs
Memory Management
Ios6
Vb.net
Events
Pointers
Module
C
Prometheus
Apache2
Methods
Encryption
Flash
Binding
Terraform
Mono
Ruby On Rails 4
Video
Visual Studio 2015
File Upload
Browser
Google Plus
Fiware
Jupyter Notebook
Cocoa Touch
Windows Phone
Keras
Service
Github
Optimization
Image Processing
Iphone
Drop Down Menu
Twitter Bootstrap
Jquery Mobile
Linq
3d
Flask
Oauth 2.0
Paypal
Computer Science
Here Api
Menu
Grails
Javafx 2
Gps
Netlogo
Openlayers
Sharepoint 2007
Macos
Com
Configuration
Botframework
Sql
Rxjs
Mvvm
Sql Server 2008 R2
List
Macros
Glassfish
Wix
Openerp
Omnet++
Scala
Windows Phone 7
Electron
Bootstrap 4
Amazon Web Services
Google Drive Api
Hbase
Angular6
Ios8
Office Js
Hybris
Concurrency
Erlang
Xaml
Design Patterns
Linker
Windows Store Apps
Unity3d
Math
Netsuite
Silverlight
Sml
Telegram
Web Services
Push Notification
Three.js
Scikit Learn
Couchbase
Ssh
Influxdb
Blockchain
Composer Php
Qt4
Android Emulator
File Io
Twitter Bootstrap 3
Ember.js
Google Cloud Platform
Url
Drupal 7
Sqlite
Subsonic
Artificial Intelligence
Path
Php
Dynamics Crm 2011
Qml
Docusignapi
Asp.net Mvc
Dataframe
Mapbox
Machine Learning
Orm
Angular Material
Pagination
Windows Services
Dll
Windows Phone 8
Google Maps
Jsf 2
Raspberry Pi
EmptyTag
Jquery
Calendar
Sparql
Aem
Osgi
Pytorch
Ssl
Postman
Matlab
Zend Framework
Android Layout
Crystal Reports
Netty
Applescript
Cakephp
Open Source
Data Binding
Android Studio
Corda
Mariadb
Telerik
Opencl
Vb6
Pdf
Mongodb
Extjs
Verilog
Dictionary
Fullcalendar
Networking
Puppet
Google Chrome Extension
Webview
Tridion


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网