Python 2.7 Python Scrapy单击html按钮_Python 2.7_Web Scraping_Scrapy - Fatal编程技术网

Python 2.7 Python Scrapy单击html按钮

python-2.7 web-scraping scrapy

Python 2.7 Python Scrapy单击html按钮,python-2.7,web-scraping,scrapy,Python 2.7,Web Scraping,Scrapy,我不熟悉scrapy，并将scrapy与Python2.7一起用于web自动化。我想在网站上点击一个html按钮，打开一个登录表单。我的问题是，我只想点击一个按钮，然后将控件转移到新页面。我读过所有类似的问题，但没有一个令人满意，因为它们都包含直接登录或使用selenium 下面是按钮的HTML代码，我想访问http://example.com/login其中有登录页面 <div class="pull-left"> <a href="http://example.co

我不熟悉scrapy，并将scrapy与Python2.7一起用于web自动化。我想在网站上点击一个html按钮，打开一个登录表单。我的问题是，我只想点击一个按钮，然后将控件转移到新页面。我读过所有类似的问题，但没有一个令人满意，因为它们都包含直接登录或使用selenium

下面是按钮的HTML代码，我想访问http://example.com/login其中有登录页面

<div class="pull-left">
    <a href="http://example.com/login" class="emplink">Employers</a>

我是否需要在每次访问链接时使用“屈服”和回调到新功能，或者有其他方法可以做到这一点

您需要的是产生一个新的请求或更轻松地做出

响应。请按照以下步骤操作：
关于回调，它基本上取决于页面被解析的容易程度，例如，检查文档上的部分，您需要的是生成一个新的请求或更容易做出一个响应
关于回调，它基本上取决于页面被解析的容易程度，例如，检查文档上的部分
import scrapy

class QuotesSpider(scrapy.Spider):
    name = 'pro'
    url =  "http://login-page.com/"


def start_requests(self):
    yield scrapy.Request(self.url, self.parse_login)


def parse_login(self, response):
    employers = response.css("div.pull-left a::attr(href)").extract_first()
    print employers

def parse_login(self, response):
    next_page = response.css("div.pull-left a::attr(href)").extract_first()
    if next_page is not None:
        yield response.follow(next_page, callback=self.next_page_parse)




[web scraping]相关文章推荐



                                                        
Web scraping 谷歌代码搜索
web-scrapingweb-crawler 
Web scraping 使用HtmlUnit获取页面源：URL被卡住
web-scrapingmonitoring 
Web scraping Q:scrapy redis没有'；I don’我一页也不刮，一秒钟就写完了
web-scrapingscrapy 
Web scraping 在此服务器上找不到请求的URL/\u Incapsula\u资源
web-scraping 
Web scraping 在用谷歌表单抓取Instagram粉丝时被阻止
web-scrapinggoogle-sheetsinstagram 
Web scraping PowerBI-Web抓取超过一百万页
web-scrapingpowerbi 
Web scraping Python请求无法在trip advisor上获取源代码
web-scraping 
Web scraping 请求-无法建立新连接：[Errno 8]提供了节点名或服务名，或未知'；
web-scraping 
Web scraping 自动将数据从近乎非结构化的html页面刮到google工作表
web-scrapinggoogle-sheets 
Web scraping 你能在期货市场上运行scrapy吗？
web-scrapingscrapy 
Web scraping 美丽的狼，我想把文字刮到它真正的形状
web-scraping 
Web scraping 解析的页面错误吗？
web-scraping 
Web scraping 使用Wget从目录网站下载.epub文件
web-scraping 
Web scraping Apify-如何使用“动态”命令刮取多个页面（请求队列）；“下一页”；按钮
web-scraping 
Web scraping 网页抓取设计-最佳实践
web-scrapingweb-crawlerworkflow 
                                       





随机文章推荐



                                                        
Riak-字段上的MapReduce排序
mapreduce 
Mapreduce 如何使用map reduce级联跟踪大量统计数据？
mapreducestatistics 
MapReduce是否适用于数据聚合？
mapreduce 
Hadoop 2-使用PIG over Hadoop解决MapReduce问题
mapreducecassandraapache-pig 
Mapreduce 在PIG存储中实现动态定位-正确方式
mapreduceapache-pig 
Mapreduce Couchbase视图\减少给定密钥的计数
mapreducecouchbase 
Mapreduce 无法避免在Apache Pig中重复删除
mapreduceapache-pig 
couchdb mapreduce查询多个键的交集
mapreducecouchdb


                                        

                                        
                                        


                                                
                                                        [python 2.7]相关推荐
                                                        
Python 2.7 Python-读取系统信息（CPU内核使用率和温度）
									Python 2.7
							 
Python 2.7 “机械化控制名称”；无”；
									Python 2.7
							 
Python 2.7 这个元类实现有什么问题？
									Python 2.7
							 
Python 2.7 Python：将浮点值转换为4 uint8
									Python 2.7
							 
Python 2.7 Selenium webdriver不了解网页已更改，显示的元素数不正确
									Python 2.7
							 									Selenium Webdriver
							 
Python 2.7 是否可以使用google oauth2检索刷新令牌
									Python 2.7
							 									Oauth
							 
Python 2.7 python 6错误配置？
									Python 2.7
							 
Python 2.7 Python银行帐户类错误
									Python 2.7
							 
Python 2.7 名称错误：名称'；alg&x27；没有定义
									Python 2.7
							 
Python 2.7 在Selenium Firefox概要文件-Python中禁用Ghostery插件介绍页面
									Python 2.7
							 									Selenium Webdriver
							 
Python 2.7 Python无法识别我编辑的BOTO文件
									Python 2.7
							 									Google Cloud Storage
							 
Python 2.7 Python 2.7：如何跟踪不断下降的RAM？
									Python 2.7
							 									Qt
							 									Pandas
							 									Memory
							 
Python 2.7 anaconda ImportError:pip卸载后没有名为numpy的模块
									Python 2.7
							 									Numpy
							 									Anaconda
							 
Python 2.7 如何将wav文件集转换为.sf2格式
									Python 2.7
							 
Python 2.7 Python：尝试循环遍历字符串以查找匹配的字符
									Python 2.7
							 
Python 2.7 如何将变量作为列名传递
									Python 2.7
							 									Pandas
							 
Python 2.7 如何加速两个超过10万项的DICT之间的比较
									Python 2.7
							 									Csv
							 									Dictionary
							 
Python 2.7 在python中导入模块的时间比预期的长
									Python 2.7
							 
Python 2.7 在Python2中打印二维混合数组（包含整数和浮点数据类型）中的一行
									Python 2.7
							 									Numpy
							 
Python 2.7 如何使用python中的while循环返回文件中大于某个截止值的前x个字？
									Python 2.7
							 
Python 2.7 Python2.7使用scipy拟合（最小化）函数时内存泄漏
									Python 2.7
							 									Numpy
							 									Optimization
							 									Memory Leaks
							 
Python 2.7 URL/testform处理程序与任何处理程序都不匹配
									Python 2.7
							 									Google App Engine
							 
Python 2.7 保存一个选择并在psychopy上显示另一个选择
									Python 2.7
							 
Python 2.7 如何使用BioPython获取PubMed出版物的日期
									Python 2.7
							 
Python 2.7 使用Python从PDF读取特殊字符和字体
									Python 2.7
							 
Python 2.7 如何将文件转换为\x41
									Python 2.7
							 
Python 2.7 Python:ZeroMQ是否可以设置一个回调，以便在收到消息后调用？
									Python 2.7
							 
Python 2.7 Apache Beam Hello World示例上下文版本冲突
									Python 2.7
							 									Google Colaboratory
							 
Python 2.7 皮查姆：鞋底测试不是测试，但鞋底测试是
									Python 2.7
							 									Pycharm
							 
Python 2.7 Fernet密钥的Python加密和解密错误必须是32 url安全的base64编码字节
									Python 2.7
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Here Api
Xslt
Cloud Foundry
Blackberry
Raspberry Pi
Verilog
Sms
Excel Formula
Parse Platform
Iis
Entity Framework 4
Couchdb
Google Cloud Firestore
Automated Tests
Install4j
Browser
Sencha Touch
Cors
Msbuild
Socket.io
Docker Compose
Centos
Ruby On Rails 3.1
Cypress
Telerik
Memory Management
Hbase
Apache2
Ssl
Linkedin
Heroku
Dynamics Crm 2011
Vim
Google App Engine
Python Sphinx
Mapbox
Compression
Google Bigquery
Pascal
Shopify
Twitter Bootstrap
Colors
Apache Storm
Single Sign On
Http
3d
Meteor
Xsd
Sublimetext2
Pentaho
Google Plus
Sql Server 2008 R2
Build
Virtual Machine
Grid
Xmpp
Sml
Apache Flink
Encoding
Snmp
Antlr4
Terminal
Joomla
Autocomplete
Ionic2
Layout
Silverlight 4.0
Synchronization
Dataframe
Asp.net Web Api
Sugarcrm
Sharepoint
Azure Service Fabric
Jasper Reports
Plsql
Function
Vuejs2
Sapui5
R
Lucene
Responsive Design
Interface
Neural Network
Asp.net
Spotify
Google Visualization
Angularjs
Domain Driven Design
C#
Editor
Xamarin.android
Bootstrap 4
Excel
Youtube
Jqgrid
Jmeter
Charts
Abap
Visual Studio Code
Mdx
Jwt
Sharepoint 2010
Hazelcast
Sql Server
Listview
Crystal Reports
Asterisk
Visual Studio 2010
Antlr
Javascript
Jhipster
Functional Programming
Wix
Mvvm
Modelica
Java Me
Xamarin.forms
Python 3.x
Jdbc
Mono
Jaxb
Windows
Download
Jquery Plugins
Servlets
Bison
Gwt
Content Management System
Post
Clang
Oracle10g
Asp.net Mvc 2
Machine Learning
Cucumber
Inheritance
Microsoft Graph Api
Architecture
Wicket
Playframework 2.0
Elm
Windows 7
Ag Grid
Winforms
Computer Science
Error Handling
Amazon Redshift
Orchardcms
Android Ndk
Dependency Injection
Language Agnostic
Sip
Notifications
Select
Imagemagick
Shiny
Xampp
Discord
Discord.py
Oracle11g
Google Sheets
Anaconda
Embedded
Oauth
Mapreduce
Numpy
Parallel Processing
Ubuntu
Windows Phone 7
Swiftui
Asp.net Mvc 3
Yii2
Vmware
Tomcat
Qt4
Azure Ad B2c
Signalr
Laravel
Llvm
Merge
Asynchronous
Qml
Zend Framework
Jersey
Jquery Mobile
Aws Lambda
Wordpress
Windows Installer
Jquery
Geolocation
Frameworks
Oauth 2.0
Sprite Kit
Grafana
Batch File
Filesystems
Docker
Ssrs 2008
Visual Studio 2008
Model View Controller
Oracle Apex
Hybris


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网