Python 将碎片对象导出到每个项目的一个文件中_Python_Web Scraping_Scrapy - Fatal编程技术网

Python 将碎片对象导出到每个项目的一个文件中

python web-scraping scrapy

Python 将碎片对象导出到每个项目的一个文件中,python,web-scraping,scrapy,Python,Web Scraping,Scrapy,我正在使用scrapy获取一些网页的内容。有没有办法配置scrapy，使其将每个数据线导出到单独的文件中您可以在spider中生成项目，以返回要在管道中处理的多个项目 class SomeSpider(Spider): ... def parse(self, response): # some code to parse the webpage for some_line in webpage: item = YourItem()

我正在使用scrapy获取一些网页的内容。有没有办法配置scrapy，使其将每个数据线导出到单独的文件中

您可以在spider中生成项目，以返回要在管道中处理的多个项目

class SomeSpider(Spider):

  ...

  def parse(self, response):
    # some code to parse the webpage

    for some_line in webpage:
        item = YourItem()
        # parse items

        yield item

这将为一个刮取的页面返回多个项目。然后只需指定管道，将每个项目写入单独的文件

class SomePipeline(object):

  ...      

  def process_item(self, item, spider):
      with open('file.txt', 'w') as f:

          # format your item into a line here

          f.write(line)

您的意思是将每个

项目实例放入一个单独的文件中吗？@alecxe是的，我是指每个项目




[web scraping]相关文章推荐



                                                        
Web scraping 如何获取引用计数？
web-scraping 
Web scraping 设置代理以隐藏我的IP地址，以便使用scrapy抓取网页
web-scraping 
Web scraping 如何向第三方RSS阅读器提供Firebase数据？
web-scrapingrssfirebase 
Web scraping 用于登录网站和更新的简单自动化工具
web-scrapingautomation 
Web scraping 如何从html标记中提取文本以及如何过滤它包含的文本？
web-scraping 
Web scraping 即使机器人关闭，Wget也会重定向
web-scraping 
Web scraping 美丽的汤，刮掉一个拍卖网站，在拍卖完成后清理售出物品div
web-scraping 
Web scraping response.css（'；.item name:：text'；）。extract（）不提取名称
web-scrapingscrapy 
Web scraping 使用sed或其他更好的工具从网页上删除5个字符？
web-scrapingawksed 
                                       





随机文章推荐



                                                        
使用ImageMagick将图像放置在较大的画布中
imagemagick 
Imagemagick 轻量级命令行图像大小调整器？
imagemagick 
Imagemagick：创建多个边框
imagemagick 
Imagemagick 将apng转换为具有足够分辨率和颜色深度的gif
imagemagick 
ImageMagick:运行转换时出错：转换：无法读取字体
imagemagick 
Imagemagick 调整大小后的getimagesize（）
imagemagick 
ImageMagick绘制一系列具有增长辉光效果的图像
imagemagick 
使用ImageMagick展平PNG图像
imagemagick 
在Imagemagick蒙太奇上指定标签
imagemagick 
Imagemagick 如何创建颜色为3、位深度为1的PNG图像
imagemagick 
Imagemagick Python棒序列未从内存中清除
imagemagick 
使用ImageMagick和'清理OCR的图像；textcleaner&x27；
imagemagick 
Imagemagick Imagick调整大小但不'；无法保存已调整的大小
imagemagick 
Imagemagick 如何在较小的水印/帧图像上覆盖较大的水印/帧图像？
imagemagick 
使用ImageMagick相对于文本定位另一个图像
imagemagick 
Imagemagick 图像魔术-如何压缩谷歌网页的速度？
imagemagick 
ImageMagick–；沿内部不透明对象边界分割透明图像
imagemagick 
Imagemagick 哪个ImageMagic命令可以写入图像元数据？
imagemagick 
从ImageMagick中的GIF获取第一个转换的PNG文件
imagemagick 
Imagemagick magick命令出现无法打开图像错误
imagemagick


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
Python Facebook showAddSectionButton问题
									Python
							 									Facebook
							 									Google App Engine
							 
Python从3d图像中获取文本
									Python
							 
python中的While循环（新手）
									Python
							 
在Python中为对象提供属性
									Python
							 
Python 如何在pandas中合并具有重复值的两个数据帧
									Python
							 									Pandas
							 									Dataframe
							 
Python 我的许可被拒绝使用Kivy
									Python
							 
Python 在生成另一个实例之前，我可以让ST3先杀死我的程序的前一个实例吗？
									Python
							 									Sublimetext3
							 
Python 如何在数组类型中初始化变量，以及如何为其赋值？
									Python
							 									Image Processing
							 
将类似目录树的字符串转换为嵌套列表数据结构python
									Python
							 									Algorithm
							 									Sorting
							 									Data Structures
							 
在Python中管理（即正确终止）MongoDB守护进程的最可接受的方法是什么？
									Python
							 									Multithreading
							 									Mongodb
							 
Python请求JSON API POST语句中的JSON数据格式不正确
									Python
							 									Json
							 									Api
							 
Python 在列表中搜索字母
									Python
							 									List
							 									Search
							 
Python 如何根据我从tensorflow中的另一个矩阵获得的最大值、次值和索引来获取矩阵中每一行的值？
									Python
							 									Tensorflow
							 
Python 根据另一列中的元素位置提取某些元素
									Python
							 									Pandas
							 									List
							 
Python 更改具有重复列标题的dataframe列中的数据类型
									Python
							 									Pandas
							 
Python 侦听PyQTBound信号上的新信号连接
									Python
							 
Python 使用不同的定位器访问单个值
									Python
							 									Pandas
							 									Indexing
							 
Python 如何在脚本运行时向多处理队列添加更多项
									Python
							 									Python 3.x
							 
Python 它是如何在Keras中的Conv1d中使用输入_形状变量的？
									Python
							 									Tensorflow
							 									Keras
							 
Python 从“复制html代码”；“字符串”；直到下一次”；“字符串”；
									Python
							 									Html
							 									Parsing
							 
Python 如何根据图像质量确定使用哪种OCR方法
									Python
							 									Image Processing
							 
Python多个记录器不工作。如何配置具有不同级别的多个记录器？
									Python
							 
Python3顽强地重试（带装饰器）会导致错误声明；“缺少论点”；使用gspread时
									Python
							 									Python 3.x
							 									Google Api
							 
Python 为什么我可以在不实例化实例的情况下使用random.radint方法？
									Python
							 									Python 3.x
							 									Random
							 
Python Dataframe无法正确设置matplotlib使用的索引
									Python
							 									Pandas
							 									Matplotlib
							 
使用dictionary-Python 3.6将重复行映射为原始行
									Python
							 									Pandas
							 									Dataframe
							 
Python 导入错误：无法导入名称'；时钟'；从'；时间'；（未知位置）
									Python
							 
Python 测试包含assert语句且不包含'；我什么也不退
									Python
							 									Unit Testing
							 
在python中基于字典键值更新字符串，其中键是部分字符串
									Python
							 									Sql
							 									String
							 									Dictionary
							 
Python 如何查找列表中第一位和最后一位的总和
									Python
							 									Python 3.x
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Plot
Java
Nsis
Razor
Libgdx
Apache Spark
Prometheus
.htaccess
Synchronization
Woocommerce
Coffeescript
Xcode
Stripe Payments
Next.js
Floating Point
Actionscript
F#
Google Maps Api 3
Cloud
Xmpp
Sharepoint 2013
Symfony1
Silverlight
Log4net
Mvvm
Apache Zookeeper
Orientdb
Knockout.js
X86
Windows Store Apps
Programming Languages
Google Analytics
Windows Phone 8.1
Safari
Windows
Graphql
Gdb
Gruntjs
Django Rest Framework
Dependencies
Localization
Jboss
Jquery Mobile
Gps
Teamcity
Solr
Ocaml
Svn
Salesforce
Mod Rewrite
Talend
Types
Kdb
Ubuntu
Centos
View
Sockets
Sap
Google Sheets
Install4j
Oracle10g
Oracle Apex
C# 3.0
Airflow
Cookies
Macos
Cassandra
Workflow
Kendo Ui
Java 8
Django Models
Xcode4
Webrtc
Mqtt
Wordpress
Omnet++
Yii2
Authentication
Serial Port
Numpy
Linq To Sql
Coq
Latex
Google Chrome Devtools
System Verilog
Mobile
Nunit
Cmd
Svg
Azure Active Directory
Routing
Jquery
Single Sign On
Layout
Ssrs 2008
Doxygen
Pointers
Iis 7
File
Visual Studio 2010
Google Compute Engine
Weblogic
Gatsby
Drupal 7
Scrapy
Amazon Ec2
Recursion
Unity3d
Internet Explorer
Amazon S3
Netbeans
Azure Cosmosdb
Uiview
Sprite Kit
Oauth 2.0
Linux Kernel
Nestjs
Ethereum
Memory Management
Mono
Url Rewriting
Symfony
Select
Sql Server
Mediawiki
Ckeditor
Opencv
Ravendb
Sqlalchemy
Microsoft Graph Api
Compiler Construction
Angular Material
Server
Drools
Cmake
Gmail
Collections
Redis
Phantomjs
Cypress
Itext
Leaflet
Asp.net Core
Air
Configuration
Mapreduce
Asp.net Core Mvc
Build
Ldap
Chart.js
Blazor
Com
Osgi
Openlayers
Ssis
Groovy
Blockchain
Cakephp
Visual Studio 2015
Automated Tests
Asp Classic
Sparql
Drupal 6
Ip
Exchange Server
Backbone.js
Actionscript 3
Azure Sql Database
Webpack
Session
Kubernetes
Redirect
Objective C
Internationalization
Timer
Nuget
Plugins
Ada
Audio
Continuous Integration
Jsp
Content Management System
Character Encoding
Object
Go
Django
Compiler Errors
Maven
Apache Pig
Phpstorm
Enums
Yii
Migration
Jasmine
Erlang
Android Fragments
Wso2
Apache2
Flask
Laravel 4
Entity Framework Core


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网