难以将requests.models.Response转换为scrapy.selector.unified.selector_Scrapy - Fatal编程技术网

难以将requests.models.Response转换为scrapy.selector.unified.selector

scrapy

难以将requests.models.Response转换为scrapy.selector.unified.selector,scrapy,Scrapy,此代码 import requests url = 'https://docs.scrapy.org/en/latest/_static/selectors-sample1.html' response = requests.get(url) 获取一个requests.models.Response实例，我可以使用scrapy从中提取数据 from scrapy import Selector sel = Selector(response=response) sel.xpath('//div

此代码

import requests
url = 'https://docs.scrapy.org/en/latest/_static/selectors-sample1.html'
response = requests.get(url)

获取一个

requests.models.Response

实例，我可以使用scrapy从中提取数据

from scrapy import Selector
sel = Selector(response=response)
sel.xpath('//div')

访问网站的方式。这只是其中的一部分

response = requests.get('https://www.zhihu.com/api/v4/columns/wangzhenotes/items', headers=headers)
print(response.json())

通过这种方式，我从那个网站获得了内容

但是，相同的代码无法从响应实例提取数据

sel = Selector(response=response)
len(sel.xpath('//div'))

我刚得到0分。如何解决此问题？

此请求的结果

response=requests.get（'https://www.zhihu.com/api/v4/columns/wangzhenotes/items，headers=headers）

是JSON对象，请确保它不包含任何div

要获得所需信息，必须解析该JSON

response = requests.get('https://www.zhihu.com/api/v4/columns/wangzhenotes/items', headers=headers)
data = response.json()['data']

然后，您需要循环查看

数据列表，并获取所需的字段
同样，如果你想使用scrapy，你可以请求urlhttps://www.zhihu.com/api/v4/columns/wangzhenotes/items

然后在parse
方法中将响应读取为JSON：
j_obj=json.load（response.body作为unicode（））
sel=Selector（response=response）
这里的响应是什么？是对这个请求的响应吗？response=requests.get（'https://www.zhihu.com/api/v4/columns/wangzhenotes/items“，headers=headers）
？@Roman是的，是的。




[cryptography]相关文章推荐



                                                        
Cryptography 安全，黑客，加密阅读？
cryptography 
Cryptography 是配置管理->；section.SectionInformation.ProtectSection（）是否依赖于计算机？
cryptography 
Cryptography 为什么我们需要一个恒定时间*单字节*比较函数？
cryptographygo 
Cryptography 为什么.cer文件公钥不包含RSA指数？
cryptography 
Cryptography 带PSK的mbedtls导致错误
cryptography 
Cryptography 使用STM32加密库生成签名
cryptographystm32 
                                       





随机文章推荐



                                                        
Configuration 如何找到要链接到的库？或者，如何创建*-config（例如sdl-config、llvm-config）？
我想编写一个程序，输出一个库列表，我应该链接到给定的源代码（或对象）文件（对于C或C++程序）。
configurationgcclinkercompilation 
Configuration StructureMap-为插件配置默认类型，但是否可以覆盖？
configuration 
Configuration 如何设置Hudson/Jenkins授权以克隆mercurial存储库
configurationmercurialjenkins 
Configuration 在Nginx上设置子域
configurationnginx 
Configuration 从geoserver导出图层配置
configuration 
Configuration Z3.2中缺少警告INI参数？
configurationparametersz3 
Configuration NGINX：给定自定义子域，location regex与$http#u user#u agent匹配'；行不通
服务器{
听80；
服务器名称^（？.+）\（测试）？网站\.com$；
位置^/event/（\d+）${
代理传递头服务器；
代理设置头主机$http\U主机；
代理_重定向关闭；
代理集头X-Real-IP$remote\u addr；
代理集头X-Scheme$Scheme；
上的代理截获错误；
#这就是问题所在
#此条件将打断整个位置块。
#如果我对If语句进行了注释，则默认
configurationnginx 
Configuration 在application.conf中将相对路径用作配置值
configurationplayframework 
Configuration Tmux：如何配置Tmux以在状态栏上显示窗格的当前工作目录？
configuration 
Configuration 安装：如何在启动时设置freeradius服务？
configurationinstallationcentos 
Configuration 通过防火墙配置SQL Express浏览器客户端
configurationsql-server-2008-r2 
Configuration 在'之后；html'；Gulp中仍不存在任务文件index.html
configurationgulp 
Elasticsearch “我能为”设置的最大值是多少；最大“打开”文件；弹性搜索中的配置
configuration 
Configuration Typesafe：如何为不同的环境定义不同的配置？
configuration 
Configuration html_主题_选项与使用雪花石膏的conf.py中的html_徽标
configurationpython-sphinx 
Configuration KDE5 Debian Stretch中的鼠标配置文件位于何处
configurationdebian 
Configuration 需要在Kannel中的smpp中将计划交付时间和validityperiod设置为NULL
configuration 
Configuration 当用户作为剧本的一部分失去访问权限时，优雅地不能保持幂等性
configurationansible 
Configuration 从同一工作目录运行具有不同配置的snakemake管道
configuration 
Configuration 多种语言的多语言版权文档
configurationinternationalizationpython-sphinx


                                        

                                        
                                        


                                                
                                                        [scrapy]相关推荐
                                                        
Scrapy ImportError:无法导入名称扩展名
									Scrapy
							 
Scrapy-如何获得'；src&x27；a'的值；脚本'；标签
									Scrapy
							 
scrapyd部署错误：没有名为project.models的模块；
									Scrapy
							 
表单提交后使用Scrapy进行数据刮取
									Scrapy
							 
如何从splash处理scrapy中的多个返回值
									Scrapy
							 
如何确保scrapy splash成功渲染了整个页面
									Scrapy
							 
Scrapy CSV导出在所有行中显示相同的数据
									Scrapy
							 
Scrapy 刮擦不产生结果（已爬网0页）
									Scrapy
							 
Scrapy 碎片项目元数据？
									Scrapy
							 
Scrapy 提高刮痧爬虫的爬行速度
									Scrapy
							 
Scrapy 刮擦过滤器相同的URL在“中不同”；http「；及；https"；
									Scrapy
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Clojure
Logic
Sml
Time Complexity
Installation
Lisp
Windows 7
Phpstorm
Vaadin
Matplotlib
Concurrency
Google Maps Api 3
Leaflet
Sed
Cygwin
Sprite Kit
Command Line
Graphql
Mono
Mariadb
Windows Services
Unix
Pentaho
Unit Testing
Apache
Google App Engine
Oracle11g
If Statement
Facebook Graph Api
Magento
Ipython
Templates
Geolocation
Apache Flex
Processing
Racket
Python 3.x
Camera
Pagination
Grid
Resharper
Visual C++
Cordova
Xcode
Jestjs
Character Encoding
Stm32
Windows Mobile
Javafx 2
Ubuntu
Postman
Logstash
User Interface
Image
Snmp
Adobe
Perforce
Yii2
Parallel Processing
Install4j
D
Junit
Jqgrid
Neural Network
Regex
Entity Framework Core
Nativescript
Rdf
Isabelle
Jira
Dynamic
Google Chrome Extension
Xsd
Nest
Woocommerce
Visual Studio Code
Vhdl
Autodesk Forge
Oracle Apex
Sparql
Plot
Safari
Machine Learning
Requirejs
Cloud Foundry
Mapreduce
Editor
File
Gatsby
Jhipster
Xampp
Marklogic
C++
Go
Qt4
Julia
R
Google Colaboratory
Grafana
X86
Mdx
Hive
Orchardcms
Python Sphinx
Openssl
Mercurial
Stanford Nlp
Phpunit
Webrtc
Asp.net
Sencha Touch 2
Colors
Xpages
Database Design
Maven 2
Arrays
Jaxb
Linq
Spring Mvc
Dynamics Crm 2011
Sonarqube
Tensorflow
Titanium
Clearcase
Git
Maps
Internet Explorer
Compilation
Nlp
Vagrant
Automation
Influxdb
Jboss
Prolog
Html
Model
Swing
Pointers
Axapta
Parse Platform
Swift
Ios7
Testng
Openlayers 3
Documentation
Osgi
Grep
Sms
Azure Functions
Swift2
Numpy
Cucumber
Jwt
Air
Functional Programming
Notepad++
Sharepoint 2007
Debugging
Random
Language Agnostic
Openshift
Activerecord
Chef Infra
Swiftui
Tkinter
Web Services
Pdf
Xamarin.ios
Nsis
Bison
Opencart
Drools
Bash
Protocol Buffers
Tsql
Graphics
Tree
Sencha Touch
Rx Java
Deployment
Drupal
Jquery Mobile
Servlets
Xamarin.android
Stored Procedures
Fluent Nhibernate
Memory Leaks
Twitter Bootstrap
Amazon Redshift
Dask
Linkedin
Couchbase
Asp.net Mvc 4
Ruby
Github
Xml
Loopbackjs
Phpmyadmin
Facebook
Computer Science
Websphere


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网