Python 在抓取网页时，如何检查数据是否不存在，然后传递其他行_Python_Html_Json_Web Scraping - Fatal编程技术网

Python 在抓取网页时，如何检查数据是否不存在，然后传递其他行

python html json web-scraping

Python 在抓取网页时，如何检查数据是否不存在，然后传递其他行,python,html,json,web-scraping,Python,Html,Json,Web Scraping,嗨，我正试图通过网页抓取获取数据，但在页面中，有时我请求的标记不可用，因此，如果这些数据不可用，我需要传递这些数据，或者我如何使用它们的标记获取这些数据 import requests from bs4 import BeautifulSoup from datetime import datetime header = {'User-Agent': 'Mozilla/5.0 (Windows; U; Windows NT 5.1; en-U

嗨，我正试图通过网页抓取获取数据，但在页面中，有时我请求的标记不可用，因此，如果这些数据不可用，我需要传递这些数据，或者我如何使用它们的标记获取这些数据

          import requests
    from bs4 import BeautifulSoup
    from datetime import datetime
    
    header = {'User-Agent': 'Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.0.7) Gecko/2009021910 Firefox/3.0.7'}
    
    base_url = "https://www.avva.com.tr/outlet"
    main_url = "https://www.avva.com.tr"
    r = requests.get(base_url, headers=header)
    
    if r.status_code == 200:
        soup = BeautifulSoup(r.text, 'html.parser')
        books = soup.find_all('div', attrs={"class": "ItemOrj col-3"})
        my_date = datetime.now()
        result = []
        for book in books:
            title = book.find('a')['title']
            link = main_url+book.find('a')['href']
    
            picture = book.find('img')['src']
            print(picture)
    
    
    else:
        print(r.status_code)

尝试首先选择

img

对象，并测试它是否与

None

不同。如果是，则选择

src

picture=book.find（'img'））
如果图片=无：
picture\u src=book.find（'img'）['src']
印刷品（图片）

如果不是book.find（'img'）：请在

picture=book.find（'img'）['src']之前继续使用这一行。




[html]相关文章推荐



                                                        
Html 如何将第n个子对象仅与直接子对象一起使用？
htmlcss 
Html Zend_View_Helper_Navigation_菜单创建了一个我没有创建的url'；别指望
htmlzend-frameworkurl 
Html 在同一水平面放置两个div
htmlcss 
Html 水平对齐图像
htmlcss 
Html 跨多行的动态网格分幅+；柱
htmlcss 
在HTML中添加新行
html 
Html 如何：为图像溢出省略号，以显示三个点“&引用；
htmlimage 
CSS未显示在本地服务器上的HTML文件中
htmlcss 
Html &引用；字体“一些图标”；乘法奇迹|
htmlcss 
Html 如何让导航栏元素在引导3中内联显示？
htmlcsstwitter-bootstraptwitter-bootstrap-3 
Html 在网站上嵌入twitter时间线
htmltwitter 
Html Firefox34+；忽略flexbox的最大宽度
htmlcssfirefox 
在动画CC（HTMLCanvas）中移动符号时未发生事件
htmlhtml5-canvas 
Html 使用Perl和CGI进行Web登录
htmlcsstwitter-bootstrapperl 
如何使用javascript在squish中生成html测试报告？
htmlreport 
Html 我找不到站点错误
htmlcss 
转换中的HTML文本对齐问题
htmlcss 
Html 杰基尔主题不读法维康
htmljekyll 
Html 我需要帮助将我的文本框连接到我的单选按钮？
html 
Html 从Kickstarter项目中删除文本不会返回任何内容
htmlpython-3.xweb-scrapinggraphqlpython 
                                       





随机文章推荐



                                                        
Certificate 缺少客户端证书´；私钥
certificate 
Certificate 如何在Linux上创建自己的通配符证书？
certificate 
Bouncycastle:X509CertificateHolder到X509Certificate？
certificate 
Certificate .p12证书/文件的使用和效用
certificate 
Certificate ADFS使用过期的证书声明信任提供程序
certificate 
Certificate ASN1到适当的示例文本
certificate 
Certificate 如何使用azure powershell从公共证书（.cer和.pfx）中提取blob
certificate


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
在ubuntu中嵌入python时出现分段错误
									Python
							 									Linux
							 									Debugging
							 
如何在python方法中执行python代码？
									Python
							 									Deployment
							 									Continuous Integration
							 
Python 我可以查询Grok'；web应用程序范围之外的ZODB实例？
									Python
							 
Python 在执行任务之前禁用QPushButton
									Python
							 									Qt
							 
Python 在pip.py中设置代理
									Python
							 									Proxy
							 									Centos
							 									Pip
							 
Python 如何将django rest框架响应传递给html？
									Python
							 									Django
							 									Django Rest Framework
							 
python mechanize文件上载UnicodeDecode错误
									Python
							 									File Upload
							 
“什么是”呢；同质的；在Python列表文档中？
									Python
							 									List
							 
Python cx\U Oracle:ImportError:DLL加载失败：此应用程序失败
									Python
							 									Dll
							 									Oracle11g
							 
Python Nosetests AssertionError输出格式
									Python
							 									Formatting
							 
Python `Sudo pip install matplotlib`找不到freetype标头。[OSXMavericks/10.9]
									Python
							 									Numpy
							 									Matplotlib
							 
在Python中使用subprocess.Popen执行shell脚本？
									Python
							 									Json
							 									Bash
							 									Shell
							 
Python 如何使用Flask处理GET查询字符串
									Python
							 									Rest
							 									Flask
							 
Python 用键打印dict中的最高值
									Python
							 									Python 2.7
							 									Dictionary
							 
如何在python中将datetime转换为整数
									Python
							 									Datetime
							 
Python二进制格式异常
									Python
							 									Binary
							 
Python scipy.optimize.leastsq使用NaN调用目标函数
									Python
							 									Numpy
							 
Python:str.split（）；“限制”；参数
									Python
							 									String
							 
Python logistic/sigmoid函数实现数值精度
									Python
							 									Floating Point
							 
Python 为什么statistics.mean（）这么慢？
									Python
							 									Performance
							 
Python 如何从组在django中创建自定义组
									Python
							 									Django
							 									Inheritance
							 									Permissions
							 
“错误”；以前的SQL不是查询"；用Python？
									Python
							 									Sql
							 									Sql Server
							 
Python requests.get返回403，而相同的url在浏览器中工作
									Python
							 									Python 3.x
							 									Unicode
							 
Python matplotlib：隐藏子地块并用其他子地块填充空间
									Python
							 									Matplotlib
							 
Python 稀疏矩阵LCP
									Python
							 
Python 在Linux的Windows子系统中使用Jupyter
									Python
							 									Jupyter Notebook
							 
Python 将pandas.groupby转换为dict
									Python
							 									Python 3.x
							 									Pandas
							 									Dictionary
							 
Python urllib3 connectionpool-连接池已满，正在丢弃连接
									Python
							 
Python 从fileobject或netCDF4数据集创建Iris多维数据集
									Python
							 
Python Cython模块中的加载与链接
									Python
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
C# 4.0
Servlets
Windows Installer
Tinymce
Ubuntu
Linker
Cors
Dom
Antlr4
Ecmascript 6
Knockout.js
Socket.io
Cypress
Swift3
Smtp
Asp.net Mvc
Activemq
Sencha Touch 2
Replace
Linux
Process
Sip
Jira
Floating Point
Netlogo
Kubernetes
Node.js
Xml
Join
Authentication
Cloud
Material Ui
Pip
Opengl
Spring Cloud
Wpf
Prometheus
Iphone
Air
Gremlin
Three.js
Io
Swift2
Rabbitmq
Scikit Learn
Glassfish
Office365
Random
Smalltalk
Class
Html
Spring Batch
Plsql
Actionscript 3
Calendar
Kernel
Amp Html
Jmeter
Selenium Webdriver
Coffeescript
Quickbooks
File Io
.net 4.0
Validation
Concurrency
Optimization
Javafx
Core Data
Ip
Virtual Machine
Forms
Cassandra
Graphql
Recursion
Filter
Azure Data Factory
Ibm Mq
Pagination
Uitableview
Mapbox
Log4net
Verilog
Clojure
Tsql
Appium
Intellij Idea
Sms
Gwt
Mongodb
Syntax
Bazel
Interface
Akka
Visual C++
Indexing
Logstash
Eclipse Plugin
Pytorch
Jar
Haskell
Merge
Acumatica
Amazon Redshift
Go
Programming Languages
Microservices
Raspberry Pi
Grid
Eclipse
Service
Meteor
Cocoa Touch
Windows Store Apps
Discord.js
Pandas
Continuous Integration
Fiware
Nestjs
Seo
Configuration
Windows Runtime
Groovy
Vue.js
Internet Explorer
Applescript
Silverlight
Serialization
Umbraco
Google Bigquery
Sitecore
Office Js
Orientdb
Winapi
For Loop
Input
Openerp
Unix
Ipython
Nhibernate
Gmail
Directx
Airflow
Svn
Google Cloud Dataflow
Url
Ssh
Report
Rx Java
Version Control
C#
Blockchain
Hive
Ibm Midrange
Memory
Laravel 5
Erlang
Time
Terminal
Razor
Regex
Character Encoding
Keras
Variables
Jpa
Python 2.7
Axapta
Imagemagick
Data Binding
Computer Vision
Typescript
Prestashop
View
Java Me
Tomcat
Cmake
Xamarin.ios
Design Patterns
Jwt
Openlayers 3
Rss
Twitter
Iis
Mapping
Llvm
Loops
Url Rewriting
Sas
Xamarin.forms
Cron
Xpath
Tridion
Angular
Dns
Fullcalendar
Asp.net Mvc 5
Maven 2
Dependency Injection
Azure Sql Database
Events
Enums
Vagrant


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网