在Python2中通过scrapy从web读取json_Python_Json_Python 2.7_Web Scraping_Scrapy - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/6/EmptyTag/139.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
在Python2中通过scrapy从web读取json_Python_Json_Python 2.7_Web Scraping_Scrapy - Fatal编程技术网

在Python2中通过scrapy从web读取json

python json python-2.7 web-scraping scrapy

在Python2中通过scrapy从web读取json,python,json,python-2.7,web-scraping,scrapy,Python,Json,Python 2.7,Web Scraping,Scrapy,我想从网页中提取JSON数据，所以我检查了它。我需要的数据以以下格式存储： <script type="application/ld+json"> { 'data I want to extract' } </script> 但它不起作用，我该如何改变它您需要在HTML源代码中找到该脚本元素，提取其文本，然后使用json.loads（）加载：在这里，我使用不太常见的应用程序/ld+json来定位脚本，但是还有许多其他选项，比如，使用

我想从网页中提取JSON数据，所以我检查了它。我需要的数据以以下格式存储：

<script type="application/ld+json">
    {
     'data I want to extract'
    }
    </script>

但它不起作用，我该如何改变它

您需要在HTML源代码中找到该

脚本

元素，提取其文本，然后使用

json.loads（）加载：
在这里，我使用不太常见的应用程序/ld+json
来定位脚本
，但是还有许多其他选项，比如，使用脚本本身中的一些文本来定位脚本：
//script[contains(., 'Restaurant')]

您需要在HTML源代码中找到script
元素，提取它的文本，然后使用json.loads（）加载它：
在这里，我使用不太常见的应用程序/ld+json
来定位脚本
，但是还有许多其他选项，比如，使用脚本本身中的一些文本来定位脚本：
//script[contains(., 'Restaurant')]

//script[contains(., 'Restaurant')]




[json]相关文章推荐



                                                        
如何将JSON响应保存到ios和android版unity3D中的文件
jsonmobileunity3d 
使用Sencha Touch 2 MVC解析JSON不起作用
jsonparsingmodel-view-controllersencha-touch-2 
Json 使用Jerkson反序列化案例类（嵌套）
jsonscala 
如何在Grails中更新现有JSON对象的参数？
jsonangularjsgrails 
使用angular获取远程json时出现Cordova错误
jsoncordova 
Rails4使用json.array重命名json元素名
json 
Json 钛appcelerator Wordpress api-显示html
jsonapiparsing 
Angular2 http获取json问题
jsontypescriptangular 
Json 将bash变量传递给jq
jsonbash 
在swift 3.0中将JSON数据显示到UILabel中？
jsonswift3 
如何在Mac Sierra上搜索json文件中的字符串
jsonmacos 
将JSON从原始格式解析为CSV，完全是新手
jsoncsvparsing 
使用Swift在Firebase中保存现有JSON文件？
jsonswiftfirebase 
Json 无法使用Angular 6和ASP.NET MVC5加载本地数据
jsonangularasp.net-mvc-5 
要查看json元数据吗
jsonswiftxcodeapi 
用于嵌套JSON上的select对象的Angular 6
jsonangular6 
Json 正在尝试从elasticsearch中的查询中获取包含多个术语的筛选响应
jsonlaravel 
Firefox插件：将加载的JSON解析/保存为HAR或文本文件
jsonfirefoxfirefox-addon 
将monster json文件拆分为较小的json文件
json 
Json Scala中的Flink：尝试将map应用于DataStreamSource时出现问题
jsonscalaapache-flink 
                                       





随机文章推荐


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
用Python沙箱处理web服务
									Python
							 									Web Services
							 
Python 用元类实现单例
									Python
							 
Python Django Sentry在尝试向localhost:9000/store发送错误时给出错误405
									Python
							 									Django
							 									Logging
							 
Python Django Formset与Modelform
									Python
							 									Django
							 									Web
							 
Python Scrapy：爬行蜘蛛不会在嵌套回调中生成所有链接
									Python
							 									Scrapy
							 
Python Django表单向导：在表单之间传递数据
									Python
							 									Django
							 
Python 如何设置一个"；只读复选框；在PySide/PyQt中
									Python
							 									Qt
							 
Python 如果在[a、b、c、d]中出现错误，则在`中出现任何积极损害`
									Python
							 
Python 非周期函数与NumPy的互相关
									Python
							 									Numpy
							 
Python 数据帧从列中检索值
									Python
							 									Pandas
							 									Dataframe
							 
Python Django：包含的urlconf core.url不'；它没有任何图案
									Python
							 									Django
							 
Python 在Sphinx中有条件地编译文档节
									Python
							 									Python Sphinx
							 
Python Numpy安装AWS EC2错误
									Python
							 									Numpy
							 									Amazon Web Services
							 									Amazon Ec2
							 									Pip
							 
为什么使用python'；将ubuntu从12.04升级到14.04后，它的numpy变得如此缓慢？
									Python
							 									Numpy
							 
python-igraph多处理任务
									Python
							 
Python tkinter选项菜单没有'；t显示选择
									Python
							 									Python 2.7
							 									Tkinter
							 
Python 基于布尔向量在numpy中选择列
									Python
							 									Arrays
							 									Numpy
							 
使用pandas或python创建特征向量
									Python
							 									Numpy
							 									Pandas
							 
Python 在脚本中使用django模型
									Python
							 									Django
							 
Python 逗号语法：语句中挂起逗号作为语法错误的基本原理
									Python
							 									Syntax
							 
Python 当索引超出范围时从列表的开头开始？
									Python
							 									List
							 									Indexing
							 
Python粗体文本
									Python
							 									Text
							 
Python Tensorflow-如何使用批次维度执行tf.gather
									Python
							 									Tensorflow
							 
Python navbar中的Django-get模型
									Python
							 									Django
							 									Templates
							 
Python 在pandas中，如何使用字符串的第一个实例对行进行操作？
									Python
							 									Python 3.x
							 									Pandas
							 
获取脚本当前在[Python]中运行的完整路径
									Python
							 									Path
							 
为什么Python共享内存需要一个副本？
									Python
							 									Numpy
							 
Python Regex.search with grouping不是收集组
									Python
							 
Python 如何显示mplfinance下载数据的时间（索引类型datetime）
									Python
							 									Datetime
							 
Python AZURE函数从AZURE BLOB读取XLSX
									Python
							 									Azure
							 									Azure Functions
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Sms
Mercurial
Google App Maker
Xml
Sip
Rxjs
Vhdl
Imagemagick
Hybris
Eclipse Rcp
Paypal
Jdbc
Google Maps Api 3
Vuejs2
Javafx
Dataframe
Eclipse
Jetty
Joomla
Pagination
Parameters
Notepad++
Virtual Machine
Linq To Sql
Protocol Buffers
Itext
Wcf
Floating Point
Filter
Reporting Services
Highcharts
Frameworks
Tree
Amazon Redshift
Boost
Jqgrid
Polymer
Debian
Atom Editor
Ios7
Octave
D
Playframework 2.0
Google Chrome Extension
Clearcase
Jquery Plugins
Teamcity
Microsoft Graph Api
Sql Server
Compression
Path
Woocommerce
Wpf
Drupal 6
Haskell
Silverlight
Iframe
Machine Learning
Cassandra
Delphi
Mqtt
Symfony1
C
Big O
Angular Material
Firefox Addon
Three.js
Autodesk Forge
Hbase
Mariadb
Jira
Pip
Activemq
Air
Aurelia
Asp.net Mvc 5
Asp.net
Socket.io
Dask
C++
Teradata
Methods
Azure Sql Database
Rest
Android Fragments
Docker Compose
Database Design
Openshift
Redis
Web
Openid
Wicket
Passwords
Loops
Apache Spark
Types
Bash
Blackberry
Xampp
Docusignapi
Azure Data Factory
Visual Studio 2010
Geolocation
Facebook
Wolfram Mathematica
Google Sheets
Office365
Keyboard
Plot
Racket
Arrays
Ajax
Typescript
Ssrs 2008
Ftp
Syntax
Scroll
Directory
Macros
Couchbase
Iis 7
Emacs
Internet Explorer
Github
Angular
Sql Server 2005
Templates
Twig
Content Management System
Sql
String
Logic
Jsf
Antlr4
Datatables
Asynchronous
Twitter
Google Cloud Firestore
Mfc
Ethereum
Ocaml
Netty
Talend
Deep Learning
Replace
Tcl
Spring Cloud
Glsl
Windows 7
Ruby On Rails 4
Prometheus
Iphone
Xsd
Nest
Yii
Numpy
Documentation
List
Ibm Midrange
Silverstripe
Mapping
Printing
Assembly
Matlab
Youtube
Maven
Shell
Artificial Intelligence
Udp
Fiware
Pine Script
Checkbox
Cron
Spring Security
File
Gulp
Testing
Ravendb
Bluetooth
Javascript
Amazon S3
Gps
Terraform
Directx
Bots
Cobol
Shiny
Testng
Eclipse Plugin
Python Sphinx
Iis
Smalltalk
Tkinter
Nginx
Lotus Notes
Django
Ffmpeg
Sass
Karate
Fullcalendar
Mvvm


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网