Python Scrapy只返回第一个结果_Python_Scrapy - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/355.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python Scrapy只返回第一个结果_Python_Scrapy - Fatal编程技术网

Python Scrapy只返回第一个结果

python scrapy

Python Scrapy只返回第一个结果,python,scrapy,Python,Scrapy,我试图从gelbeseiten.de（德国的黄页）中获取数据因此，我得到了15套始终相同的地址，但我认为应该是15个不同的地址非常感谢您的帮助。您使用的是绝对xpath表达式： address.xpath（'//span[@itemprop=“streetAddress”]///text（））而应使用相对地址（注意表达式中的前导点）： address.xpath（'.//span[@itemprop=“streetAddress”]//text（）） # -*- coding: utf-8

我试图从gelbeseiten.de（德国的黄页）中获取数据

因此，我得到了15套始终相同的地址，但我认为应该是15个不同的地址

非常感谢您的帮助。

您使用的是绝对xpath表达式：

address.xpath（'//span[@itemprop=“streetAddress”]///text（））

而应使用相对

地址（注意表达式中的前导点）：
address.xpath（'.//span[@itemprop=“streetAddress”]//text（））

# -*- coding: utf-8 -*-
import scrapy
  from scrapy.spiders import CrawlSpider
  from scrapy.http import Request
  from scrapy.selector import Selector
  from scrapy.http import HtmlResponse


class GelbeseitenSpider(scrapy.Spider):
  name = "gelbeseiten"
  allowed_domains = ["http://www.gelbeseiten.de"]
  start_urls = ['http://www.gelbeseiten.de/zoohandlungen/s1/alphabetisch']

  def parse(self, response):
    for adress in response.css('article'):
      #Strasse
      strasse = adress.xpath('//span[@itemprop="streetAddress"]//text()').extract_first()

      #Name
      name = adress.xpath('//span[@itemprop="name"]//text()').extract_first()

      #PLZ
      plz = adress.xpath('//span[@itemprop="postalCode"]//text()').extract_first()

      #Stadt
      stadt = adress.xpath('//span[@itemprop="addressLocality"]//text()').extract_first()

      yield {
        'name': name,
        'strasse': strasse,
        'plz': plz,
        'stadt': stadt,
      }




[scrapy]相关文章推荐



                                                        
scrapyd如何确定'；最新版本'；项目的版本？
scrapy 
scrapyd部署按scrapyd客户端显示0个spider
scrapy 
当Scrapy spider的实例正在爬行时，我们如何动态更改下载延迟设置？
scrapy 
Scrapy保持返回空值
scrapy 
Scrapy 刮擦/飞溅选择选项
scrapy 
Scrapy 如何通过刮取网页来确定网站的名称？
scrapyweb-crawler 
Scrapy |如果没有urllib，如何从请求中获得响应？
scrapy 
Scrapy 刮痕图像管道：如何在校验和上删除图像？
scrapy 
是否有一种方法可以从scrapy中的数据库中获取起始URL的ID？使用一些函数，从URL发出请求
scrapy 
Scrapy：使用Splash刮取JS呈现的页面
scrapy 
Scrapy 无限爬虫的请求回调问题
scrapy 
Scrapy 如何用一个命令取消所有作业？
scrapy 
难以将requests.models.Response转换为scrapy.selector.unified.selector
scrapy 
Scrapy没有得到所有的回答
scrapy 
                                       





随机文章推荐



                                                        
Php 保留html标记，就像在tinymce编辑器中一样
phptinymce 
PHP GD-Mac与Linux的不同文本偏移量
phplinuxmacos 
Php 懒惰加载，我这样做对吗？
phpapi 
如何在PHP中使用任何一个索引对整个多维数组进行排序？
php 
Php 对非对象调用成员函数get（）
phpcodeigniter 
循环中使用empty（）的PHP帮助
phploops 
通过phonegap调用Web服务器中的PHP文件
phpcordova 
Ajax使用GET将变量传递给PHP
phpajax 
如何在PHP类中调用回调函数
php 
Php 使用MYSQL在一列中搜索两个值
phpmysql 
Php正则表达式获取用户名和id
phpregex 
Php 尝试使用preg_match_all将4个或更多字符的单词与3个或更少字符的单词分组
phpregex 
Php 基于最长匹配组织搜索结果
phpmysql 
Php mysql查询以获取每个公司的所有评论
phpmysqldatabase 
Php strtotime（）要求参数1为字符串，数组在codeigniter中给定
phpcodeigniter 
“如何重复php”；如果；20次？
phpfor-loop 
Php 如何计算每个给定评级的平均评级？
phpmath 
Php 如何解决“问题”；从“中的空值”创建默认对象；错误？
php 
将Curl转换为不发送数据的Php
phpcurl 
Php Foreach循环仅显示api中数组中的最后一项
phparraysjsonloops


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
                                                        
                                                

                                                
                                                        Tags
                                                        
Css
Rdf
Animation
Teradata
Path
Regex
Hadoop
Architecture
Fluent Nhibernate
Graphql
Stored Procedures
Silverlight
Subsonic
Vb6
Canvas
Join
Stanford Nlp
Heroku
Dependencies
Zsh
Itext
Inheritance
Netbeans
Dataframe
Keycloak
C# 3.0
Excel
Ruby On Rails
Yii
Wpf
Moodle
Oauth 2.0
Jira
Couchdb
Ios6
Sql Server 2008 R2
EmptyTag
Google Maps Api 3
Ocaml
C
Winapi
Jasmine
Cobol
Android Fragments
Python
Air
Vector
Orm
Generics
Openstack
Hive
Excel Formula
Github
Ember.js
Requirejs
Amazon Ec2
Gnuplot
Junit
Dynamics Crm 2011
Asterisk
Jqgrid
Ant
Opengl Es
Swing
Sprite Kit
Map
Routes
Ag Grid
Colors
Apache Storm
Maven
Spring Batch
Nosql
Docker Compose
Quickbooks
Data Binding
Three.js
Snowflake Cloud Data Platform
Internationalization
C#
Nservicebus
Server
Inno Setup
Razor
.net Core
Vba
Amazon Dynamodb
Dom
Fullcalendar
Linker
Smalltalk
Airflow
Pycharm
Azure Sql Database
Calendar
Phpmyadmin
Umbraco
Microservices
Localization
For Loop
Ssis
Ipython
Sails.js
Gulp
Postman
Websphere
Stm32
Codeigniter
Magento2
Macos
Primefaces
Grails
Omnet++
Swagger
Ios8
Powershell
Asp Classic
Sparql
Cygwin
Html5 Canvas
Tinymce
Ruby On Rails 3
Chart.js
Image Processing
Xaml
Sqlalchemy
Amazon Redshift
Vmware
Clearcase
E Commerce
Cassandra
Lua
Cron
Google Analytics
Grid
Axapta
Sublimetext3
Netty
Ruby On Rails 3.2
Streaming
Openlayers 3
Gridview
Angularjs
Video Streaming
Jaxb
Gmail
Shiny
Redirect
Pagination
Tensorflow
Syntax
Gremlin
Ubuntu
Sonarqube
Leaflet
Csv
Computer Vision
Openerp
Plugins
Debugging
Angular
Udp
Awk
Input
Parameters
Python 3.x
Ibm Cloud
Android Ndk
Command Line
Appium
Installation
Next.js
Chef Infra
Phantomjs
Smtp
Twilio
Jms
Mono
Spring Integration
Xamarin.android
Sharepoint 2007
Sap
Libgdx
Database Design
Video
Biztalk
Jupyter Notebook
Openssl
Cypress
String
Dynamic
Rust
Java
Exception Handling
Jsf
Acumatica
Android
Javafx
Twitter
Oracle
Knockout.js


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网