Python 在Scrapy中的单个列表中追加产品名称_Python_List_Append_Scrapy - Fatal编程技术网

Python 在Scrapy中的单个列表中追加产品名称

python list scrapy

Python 在Scrapy中的单个列表中追加产品名称,python,list,append,scrapy,Python,List,Append,Scrapy,我目前的scrapy spider根据需要从网站上提取产品标题，但将每个起始url的提取放在一个单独的['product']列表中。我希望所有的start_url提取都放在一个单独的列表中（对于每个类类型：product、price等），这样我就可以在以后的提取操作中调用列表中的每个产品标题这是我现在的蜘蛛： from scrapy.spider import BaseSpider from scrapy.selector import HtmlXPathSelector from pr

我目前的scrapy spider根据需要从网站上提取产品标题，但将每个起始url的提取放在一个单独的

['product']

列表中。我希望所有的start_url提取都放在一个单独的列表中（对于每个类类型：product、price等），这样我就可以在以后的提取操作中调用列表中的每个产品标题

这是我现在的蜘蛛：

 from scrapy.spider import BaseSpider
 from scrapy.selector import HtmlXPathSelector
 from proj.items import projItem

 class siteSpider(BaseSpider):
     name = "newSpider"
     allowed_domains = ["http://www.sample.url/"]
     start_urls = [
         "http://sample1.url",
         "http://sample2.url"
             ]

     def parse(self, response):
         hxs = HtmlXPathSelector(response)
         items = []
         item = FlecheNoireItem()
         item ["product"] = hxs.select('//h2/a[contains(@class,"next_prev")]/text()').extract()
         items.append(item)
         return items

你想在哪里进行操作？如果您想要一个可在整个spider中使用的通用列表，您可以使用：

class siteSpider(BaseSpider):
    ...
    generic_dict = {'product': [], 'price': [], 'etc': []}
    ...

    def parse(self, response):
        ...
        self.generic_dict['product'].append(hxs.select(...))
        ...

    def manipulations(self):
        ...manipulations here...
        return self.generic_dict

之所以有这些不同的列表，是因为对于每个

start\u url

url，scrapy调用了一个新的

parse

函数。因此，每次它都会重新初始化您的

项目列表




[list]相关文章推荐



                                                        
List 理解Scala中的中缀方法调用和cons运算符（：：）
listscala 
List 在Erlang中，如何从给定id值的记录列表中返回整个记录？
listerlang 
List 在F中生成列表时类型不匹配#
listtypesf# 
List 在Scheme中搜索并添加到列表
listsearchscheme 
List 将列表组合推广到N个列表
listscalarecursion 
List Python—将现有列表的内容作为字典值传递给接受多个参数的函数？
listpython-2.7dictionary 
List 检查元素是0还是Scala中的任何其他值
listscala 
List 方程don'；无法在标记文件的列表中正确显示
listmathmarkdown 
List 从列表和子列表中删除成员的Scheme函数
listrecursionscheme 
List 在遍历列表后，从列表的开头开始
listloopshaskell 
List prolog中满足一定条件的重复组合
listprolog 
List 按日期差异筛选Scala列表
listscalafilter 
List 在python中将列表中的列表连接在一起
list 
List Haskell：比较元组列表中的元素
listhaskell 
List 使用列表理解将元组列表转换为列表
listhaskell 
List LINQ提取重复数据超过3个
学年
{
公众号；
公共列表月份=新列表（）；
}
班级月份
{
公共国际月号；
公共列表天数=新列表（）；
}
上课日
{
公共整数日数；
公共字符串事件；
}
listlinq 
List 如何在Haskell中生成两个列表
listhaskell 
List 如何在haskell中创建一个函数，从列表中获取一个单词并进行搜索
listhaskellsearch 
List 计算元组内列表的平均值
listhaskell 
如何禁止Dart中的List add（）方法？
list 
                                       





随机文章推荐



                                                        
Windows mobile 使用ce 4.2处理持久存储和冷靴
windows-mobilemobile 
Windows mobile 如何更换或自定义Windows Mobile'；什么是锁屏？
windows-mobile 
Windows mobile 如何在wince 6.0中获得alphablending？如何在wince 6.0中删除背景图像？
windows-mobilewinapi 
Windows mobile 移动库存应用程序--需要建议
windows-mobile 
Windows mobile VMWare ESXi 4.1.0下Windows 7上的移动设备Emulator 3.0
windows-mobilevmware 
Windows mobile windows mobile6.5 professional设备的默认分辨率是多少
windows-mobile 
Windows mobile 摩托罗拉symbol ES400通知
windows-mobile 
Windows mobile Windows Mobile 6.5上的HTML/JS应用程序
windows-mobile 
Windows mobile Windows CE和Windows Mobile上VNC中的反向连接（侦听模式）
windows-mobile


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
python重新编译美丽的汤
desc=re.compile（'（.*），re.DOTALL）
findDesc=re.findall（描述，链接源）
对于findDesc中的i：
打印i
'''

这些引人注目的装饰物圆滑而独特，将成为您假日装饰的明星。这些独特的玻璃冰柱装饰物是由印度工匠手工制作的。

'''
									Python
							 									Regex
							 
Python：如何使“下一步”按钮读取不同的文件。
									Python
							 									Button
							 									File Upload
							 
PYTHON中的函数
									Python
							 									Function
							 
Python 定界符和csv文件
									Python
							 									Csv
							 
创建字节对象python
									Python
							 
Python-比较2个csv文件，将匹配行的最后一列写入第3个文件
									Python
							 									Csv
							 
识别发送用户Python-IRC
									Python
							 									Bots
							 
IPython vs.Spyder中的cd命令
									Python
							 									Ipython
							 
Python 2如何比较字符串和int？为什么列表比数字大，元组比列表大？
									Python
							 									Types
							 
Python 如何在网页上嵌入web浏览器
									Python
							 									Browser
							 									Streaming
							 
Python 如何允许Flask管理员接受JSON以创建记录
									Python
							 									Json
							 									Flask
							 
Python 我想从网站自动抓取1到10页的数据。我怎么做？
									Python
							 									Python 2.7
							 									Web Scraping
							 									Web Crawler
							 									Ipython
							 
Python 批处理脚本-将文件从文件列表复制到文件列表中的位置
									Python
							 									Windows
							 									Batch File
							 
Python 正则表达式匹配清晰的单标点符号+；减少连续标点符号
									Python
							 									Regex
							 
Python：将文本文件读入字典
									Python
							 									Dictionary
							 
Python 在某些条件下将列表分解为子列表
									Python
							 									Python 3.x
							 
isdigit（）为False时如何打印字符串[Python 2.7]
									Python
							 
Python-查找所有值的总和
									Python
							 
Python Jupyter忽略pyqtgraph类`_repr_png`，但使用自己的类'_巴布亚新几内亚共和国_
									Python
							 									Python 3.x
							 									Jupyter Notebook
							 
Python 如何将文本中的行分隔成不同的用途
									Python
							 									Python 3.x
							 
Python 这两段代码有什么不同？
									Python
							 									Class
							 									Oop
							 									Object
							 
Python 为什么在最后的步骤中ELIF命令出现语法错误。？
									Python
							 
pythonbs：获取具有和不具有颜色属性的行
									Python
							 
Python “如何修复”；TypeError：列表索引必须是整数或片，而不是str"；在战舰游戏中
									Python
							 									Python 3.x
							 
Python 如何将中的词典格式的输出更改为另一种格式
									Python
							 									Dictionary
							 
Python 导入带有字符串的类
									Python
							 									Python 3.x
							 									Import
							 
其他IIS站点中出现错误，原因是在IIS中托管Python Web应用程序之后
									Python
							 									Iis
							 									Flask
							 
Python 培训后如何使用VGG16和keras预测图像（数据集外部）？
									Python
							 									Tensorflow
							 									Keras
							 
Python 如何在文本上画画而不是多次打印？
									Python
							 									Windows
							 
Python 错误：UnicodeCodeError:&x27；charmap'；编解码器可以'；t解码位置715中的字节0x8d：字符映射到<；未定义>；
									Python
							 									Python 3.x
							 									Character Encoding
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Xcode4
Sphinx
Date
Android Fragments
Llvm
Automation
Fonts
Scroll
Terminal
Webgl
Common Lisp
Jestjs
Google Analytics
Macros
Sed
Azure Cosmosdb
EmptyTag
Firefox
Error Handling
Objective C
Ide
Atom Editor
Makefile
Scikit Learn
Rspec
Material Ui
Process
Smalltalk
Vb6
Google Cloud Platform
Identityserver4
Couchdb
E Commerce
Installation
Couchbase
Google Visualization
Git
Sails.js
Ruby On Rails 3
Kernel
Swiftui
Puppet
Sdk
Prestashop
Facebook
Xslt
Sparql
Google Cloud Dataflow
Web Scraping
Parameters
Github
Open Source
Aframe
Filesystems
Servlets
Azure Devops
Mono
Windows
Kibana
Content Management System
Groovy
Vba
Prolog
Cypress
Logstash
Pascal
Mips
Youtube
Caching
Ssis
Interface
Ftp
Function
Reporting Services
Shell
Swift2
Google Compute Engine
Oracle Apex
Calendar
Design Patterns
Java 8
Amazon Ec2
Sharepoint 2010
Ssas
Generics
Serial Port
Animation
.net
Checkbox
Spring Boot
Mapping
Rest
C# 3.0
Office365
Login
Lambda
Pip
Drupal
Google Plus
Notepad++
Vaadin
Tableau Api
Sml
Winapi
Replace
Vector
Testng
Internet Explorer
Rdf
Windows Phone 7
Ag Grid
Moodle
Asp.net Mvc 2
Apache Camel
Merge
Codenameone
Dns
Reference
Arduino
Composer Php
Firefox Addon
Cocos2d X
Yii
Rally
Java Me
Mpi
Gcc
Arm
Reflection
Graphql
Activerecord
R
Entity Framework 4
Jsp
Scheme
Mongodb
Nestjs
Postman
Web Crawler
Perl
Mysql
Nest
Sharepoint 2013
Nunit
Azure Ad B2c
Module
Gtk
Influxdb
X86
Optimization
Spring Security
Magento
Google Calendar Api
Oracle
Random
Maven
Google Maps Api 3
Graphviz
Xamarin.forms
Sap
Blackberry
Combobox
Odoo
Java
Grails
Unicode
Eclipse Rcp
Embedded
Active Directory
Concurrency
Serialization
Hyperlink
Octave
Pycharm
Kendo Ui
Cassandra
Fluent Nhibernate
Robotframework
Elm
Jsf 2
Jquery
Arrays
Bazel
Https
Gulp
Gmail
Eclipse
Browser
Webview
Drupal 7
Scrapy
Dictionary
Nginx
Discord
Go
File Io
Uml
Scripting
.net Core
Email
Collections


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网