Python Scrapy Xpath中的逃逸美元符号_Python_Regex_Xpath_Scrapy_Scrapy Spider - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/294.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/4/regex/18.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python Scrapy Xpath中的逃逸美元符号_Python_Regex_Xpath_Scrapy_Scrapy Spider - Fatal编程技术网

Python Scrapy Xpath中的逃逸美元符号

python regex xpath scrapy

Python Scrapy Xpath中的逃逸美元符号,python,regex,xpath,scrapy,scrapy-spider,Python,Regex,Xpath,Scrapy,Scrapy Spider,使用“刮痧蜘蛛”及其拉错的价格输出 HTML: 结果: 'price': [u'\u20ac300] 似乎是价格中的“$”引起了问题。我一直在挖掘，我似乎找不到一个答案，我认为这是一个共同的问题，这让我认为这可能是我错过的更多非常感谢您的帮助使用re代替提取： ['price'] = sel.xpath('.../span[1]/text())').re('\d+') Casimir et Hippolyte是正确的，检索到了正确的结果，但它在Python中的表示形式看起来有所不同。但除

使用“刮痧蜘蛛”及其拉错的价格输出

HTML:

结果:

'price': [u'\u20ac300]

似乎是价格中的“$”引起了问题。我一直在挖掘，我似乎找不到一个答案，我认为这是一个共同的问题，这让我认为这可能是我错过的更多

非常感谢您的帮助

使用re代替提取：

['price'] = sel.xpath('.../span[1]/text())').re('\d+')

Casimir et Hippolyte是正确的，检索到了正确的结果，但它在Python中的表示形式看起来有所不同。但除此之外，您的XPath表达式并不理想
尽量不要依赖冗长的位置XPath表达式，当HTML文档发生微小更改时，它们很容易中断
相反，尝试通过元素的属性查找元素。也许这个类属性的组合是唯一的？比如说

//span[@class = 'b-product_price-standard b-product_price-standard--line_through']

我可以工作。如果没有，则必须显示更多要从中选择的HTML文档。
结果是正确的（并且是欧元），这只是用unicode代码点表示ascii范围之外的字符的一种方法。请尝试打印（u'\u20ac300'）。点击此链接：@casimirithippolyte谢谢！我都没想到。
['price'] = sel.xpath('.../span[1]/text())').re('\d+')

//span[@class = 'b-product_price-standard b-product_price-standard--line_through']

[regex]相关文章推荐

随机文章推荐

Ibm mobilefirst onResume不适用于Worklight项目中的android启动模式标准 ibm-mobilefirst

Ibm mobilefirst worklight 5.0.6-将应用部署到本地服务器时出错 ibm-mobilefirst

Ibm mobilefirst IBM Worklight:从后端调用推送通知适配器 ibm-mobilefirst

Ibm mobilefirst IBM Worklight-可以使用开发者版创建商业应用程序吗？ ibm-mobilefirst

Ibm mobilefirst 使用本地Liberty server的Worklight SSL错误 ibm-mobilefirst

Ibm mobilefirst IBM应用程序中心-创建组/用户 ibm-mobilefirst

Ibm mobilefirst 如何在Worklight 6.2中实现OAuth？ ibm-mobilefirst

Ibm mobilefirst 在iPad上直接更新到Windows上的Worklight Studio开发服务器不工作 ibm-mobilefirst

Ibm mobilefirst Worklight更改消息标题 ibm-mobilefirst

Ibm mobilefirst 用于插件的IBM Worklight Developer API ibm-mobilefirst

Ibm mobilefirst 在MobileFirst 7中部署项目环境时遇到问题 ibm-mobilefirst

Ibm mobilefirst MFP CLI 7.1 war实时重新加载是否可能？目前我正在使用Eclipse与MauliFixStudio开发MauliLe1.1平台基础项目。我想使用实时重新加载（文件更改时自动将war重新部署到服务器），但使用mobilefirst platform CLI时。这可能吗？ ibm-mobilefirst

Ibm mobilefirst 直接更新txt文件 ibm-mobilefirst

Ibm mobilefirst MobileFirstStudio 8.0安装 ibm-mobilefirst

Ibm mobilefirst IBM AppCenter控制台显示默认应用程序图标，而不是ipa中的实际应用程序图标 ibm-mobilefirst

Ibm mobilefirst 配置MobileFirst Push SMS设置参数（至、文本） ibm-mobilefirst

Ibm mobilefirst dashDB服务计划无法识别IBM MobileFirst 8.0提供的配置文件 ibm-mobilefirst ibm-cloud

[python]相关推荐

Tags

View Tcp Azure Data Factory Visual Studio 2017 Terminal Protocol Buffers Influxdb Nosql Ocaml Apache Zookeeper Rx Java Forms Laravel Google Maps Netty Ssh Gps Android Ndk Button Zend Framework Gis Raspberry Pi Java 8 Amazon Dynamodb Search Nsis Kotlin Php Ms Office Elm Jhipster Linux Kernel Responsive Design Command Line Netsuite Django Rest Framework Visual Studio 2008 Documentation Random Spotify Db2 Jms Geolocation Openlayers 3 Asp.net Web Api Visual C++ Loopbackjs Virtualbox Spring Batch Outlook Hive Cordova F# Asp.net Heroku Java Me Alfresco Gitlab Artificial Intelligence Login Linkedin Material Ui Ruby On Rails 3.1 Jqgrid Continuous Integration Parse Platform Jira Corda Drupal 7 Types Ibm Mq File Rally Coding Style Angularjs Xpath Marklogic Mdx Autocomplete Twitter Web Magento Security Cron Joomla Post Graphql Ibm Cloud Time Cygwin Postman Ssrs 2008 Email Odata Windows Mobile Video .net Core Stm32 Jasper Reports Sapui5 Maven 2 Requirejs Yocto Karate Spring Security Floating Point Centos Multithreading C++ Rxjs Encryption Gcc Nunit Ios8 Networking Intellij Idea Seo Azure Functions Properties Robotframework Project Management Vba Chef Infra File Upload Webstorm Azure Ionic Framework Memory Jar Replace Batch File Arrays Enums Eclipse Plugin Windows Phone 8 Google Drive Api Xcode Aem Ldap Twig Sqlalchemy Apache Flex Kubernetes Doxygen Pointers For Loop Dynamic Xampp Functional Programming Graph Scripting Artifactory Tinymce Mips Isabelle Jquery Ui Air Notepad++ Mqtt Permissions Assembly Testng Layout Perl Instagram Rabbitmq Phantomjs Ruby Animation Webpack Arangodb Hybris Omnet++ Node.js Lua Eclipse Rcp Amp Html Arduino Lisp Qml Antlr4 Magento2 Parameters Struct Css Laravel 5 Youtube Syntax Ios6 Ibm Midrange Cocos2d Iphone Zurb Foundation Tags Itext Iphone Dialogflow Es Sencha Touch Azure Devops Calendar Primefaces Debian

Copyright © 2024. All Rights Reserved by - Fatal编程技术网