Python BeautifulSoup-标记下缺少标记_Python_Tags_Beautifulsoup - Fatal编程技术网

Python BeautifulSoup-标记下缺少标记

python tags

Python BeautifulSoup-标记下缺少标记,python,tags,beautifulsoup,Python,Tags,Beautifulsoup,所以，我想从“h1”标签中获取文本。我使用的是BeutifulSoup，它工作正常，直到“article”标记中没有“h1”标记，然后我得到“'NoneType'对象没有属性'contents'错误。代码如下： from bs4 import BeautifulSoup page = "<article> <a href="http://something"> </a> (missing "h1") <a hr

所以，我想从“h1”标签中获取文本。我使用的是BeutifulSoup，它工作正常，直到“article”标记中没有“h1”标记，然后我得到“'NoneType'对象没有属性'contents'错误。代码如下：

from bs4 import BeautifulSoup

page = 

    "<article>
    <a href="http://something">
    </a>   (missing "h1")
    <a href="http://something">
    </a>
    </article>
    <article>
    <a href="http://something">
    </a>
    <a href="http://something">
       <h1>something</h1>
    </a>
    </article>
    <article>
    <a href="http://something">
    </a>
    <a href="http://something">
       <h1>something</h1>
   </a>
   </article>"

soup = BeautifulSoup(page, "lxml")

h1s = []

articles = soup.find_all("article")


for i in range(1,len(articles)):
    h1s.append(articles[i].h1.contents)

从bs4导入美化组
第页=
“（缺少“h1”）
"
汤=美汤（第页，“lxml”）
h1s=[]
文章=汤。全部查找（“文章”）
对于范围（1，len（文章））中的i：
h1s.append（articles[i].h1.contents）

这些是当我检查带有h1标记和不带h1标记的行时的消息

type(articles[0].h1) 
<type 'NoneType'>
type(articles[1].h1)
<class 'bs4.element.Tag'>

类型（文章[0].h1）
类型（文章[1].h1）

您只需循环查看

文章，这是一个列表，然后使用find_all（）
方法获取a
标记中的所有h1
，然后将其文本添加到h1s中。似乎这就是您想要的-
h1s = []
articles = soup.find_all("article")
for i in articles:
    for x in i.find_all('h1'):
            h1s.append(x.text)




[tags]相关文章推荐



                                                        
Tags CVS：获取某个日期内某个分支的状态
tags 
Tags 通过Hpricot从页面获取最大图像
tags 
Tags 将一个mp3标记的值复制到一批mp3文件的另一个标记
tags 
Tags 创建标记的可表示签名
tagscompression 
Tags 升华文本不显示从div到div的垂直线
tagssublimetext2 
Tags 寻找适用于所有NFC手机的通用NFC标签
tags 
Tags 如何将组标记为yammer转换
tags 
                                       





随机文章推荐



                                                        
Servicestack选项404和Cors原点
cors 
使用PockDB访问Cloudant时出现CORS问题
corscouchdbibm-cloud 
如何禁用Sails v0.12默认CORS挂钩？
corssails.js 
GeoSever REST API-CORS阻塞
cors 
带有CORS插件的Kong入口控制器飞行前连接失败
cors 
Ionic v4 with Credentials true给我cors错误
corsangular6 
Cors 带有cognito授权人cros错误的api网关终结点？
cors 
Azure APIM在API Post方法上获取CORS错误
cors


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
我应该使用哪种框架用Python构建电子商务站点
									Python
							 									Frameworks
							 
将python setup.py安装到另一个路径中不需要'；找不到已安装的软件包
									Python
							 									Installation
							 
在python中生成函数列表
									Python
							 									List
							 									Lambda
							 
将python模块向上拉入包名称空间
									Python
							 
Python open（"；x"；，"；r"；）函数，我如何知道或控制文件应该使用哪种编码？
									Python
							 									Encoding
							 
在python中将子文件夹内容移动到父文件夹
									Python
							 
Python SQLAlchemy声明性扩展与elixir
									Python
							 									Sqlalchemy
							 
在Python中禁用SSL证书验证
									Python
							 									Ssl
							 
Python组合生成
									Python
							 									Python 2.7
							 
用Python对文本文件进行排序
									Python
							 									Algorithm
							 									File
							 									Sorting
							 
python中字符串的打印长度
									Python
							 									String
							 
Python Django:ListView与post（）方法？
									Python
							 									Django
							 
BeautifulSoup-Python中的嵌套标记
									Python
							 
Python 如何将级数与标量进行比较
									Python
							 									Pandas
							 									Dataframe
							 
Python 如何解析所选输出的配置文件？
									Python
							 									Perl
							 									Parsing
							 
Python 迭代列列表以打印出.value_counts（）
									Python
							 									Pandas
							 
python3中读取/解析XML url的最佳方法
									Python
							 									Json
							 									Xml
							 									Parsing
							 
中途使用线程取消函数-Python
									Python
							 									Multithreading
							 									Tkinter
							 
Float对象不可iterable[Python]
									Python
							 
Python 如何在AJAX中通过POST请求从Django表单发送特定数据？
									Python
							 									Django
							 									Ajax
							 									Post
							 
Python If语句问题登录系统简单
									Python
							 
如何在python中检查句子是否以非英语单词开头？
									Python
							 									List
							 
Python：我需要位置访问次数的百分比（-对于每个customerID）
									Python
							 									Pandas
							 
python中的For循环循环次数不必要
									Python
							 
Python 如何使用正则表达式拆分字符串，而不使用第一个和最后一个特殊字符
									Python
							 									Regex
							 
Python 对对象列表进行排序，并按属性（适应度）值重新编制索引
									Python
							 									List
							 									Sorting
							 									Object
							 
如何在python中使用mysql函数
									Python
							 									Mysql
							 
有没有办法将python字符串转换为python代码？
									Python
							 									Regex
							 
Python 如何查找用户在服务器Discord.py中发送的邮件数
									Python
							 									Discord
							 									Discord.py
							 
json.dump不'；不要转储任何东西，留给我一个空文件| Python
									Python
							 									Json
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Alfresco
Struts2
Ruby
Autohotkey
Glsl
Tcp
E Commerce
Exchange Server
Localization
Express
Ios8
C# 4.0
Sockets
Grails
Wxpython
Vue.js
Sencha Touch
Selenium
Vbscript
Configuration
Xquery
Netty
Graphics
Map
Wordpress
Keras
Ant
Azure Data Factory
Google Maps Api 3
Synchronization
Nest
Forms
Download
Dll
Ruby On Rails 3
Mapbox
Neural Network
Gremlin
Smtp
Cmake
Embedded
Jaxb
Kubernetes
Oauth
Reference
Url
Dynamics Crm
Big O
Elm
Fiware
Groovy
Com
Visual Studio Code
Cloud Foundry
Grid
C++
Octave
Leaflet
Mediawiki
Jasmine
Azure Service Fabric
Object
Applescript
View
Swift
Svn
Amazon Cloudformation
Maven 2
Tfs
Drupal 7
Python 2.7
Visual Studio 2015
Mod Rewrite
Optimization
Windows Installer
Stripe Payments
For Loop
Sql Server 2008
Amazon Dynamodb
Jestjs
Gatsby
Cmd
Svg
Ansible
Login
Exception
Scheme
Llvm
Graph
Url Rewriting
Inno Setup
Raspberry Pi
Linker
Compiler Construction
Internet Explorer 8
Npm
Jekyll
Nestjs
Signalr
Safari
Mysql
Stanford Nlp
Acumatica
Tensorflow
Listview
Rest
Grep
Flask
Fullcalendar
Design Patterns
Ide
Ajax
Single Sign On
Anaconda
Kotlin
Html5 Canvas
Memory Leaks
Functional Programming
Unix
Visual Studio 2013
Facebook
Amp Html
Material Ui
Streaming
Logic
Xcode
Azure Cosmosdb
Uitableview
Sql Server 2012
Drools
Nosql
Phpmyadmin
Sharepoint
Python Sphinx
Aframe
Colors
Selenium Webdriver
Openlayers
Mvvm
Sharepoint 2013
Image
Computer Vision
Sdk
Sql Server 2008 R2
Azure Active Directory
Rx Java
Ldap
Gridview
Three.js
Ibm Mq
Lisp
Clojure
Kentico
Redis
Dojo
Frameworks
Google Maps
Linq To Sql
Network Programming
Android Emulator
Gcc
Amazon Redshift
Windows 8
Docker Compose
Intellij Idea
Properties
Woocommerce
.htaccess
Servlets
Linq
Biztalk
Lua
Lucene
System Verilog
Machine Learning
Datatables
Smalltalk
Swift2
Angular6
Arrays
Pine Script
Blockchain
Language Agnostic
Quickbooks
Ibm Mobilefirst
Firefox
Sql
Sonarqube
Openshift
Compiler Errors
Db2
Text
Eclipse Rcp
Jquery Plugins
Maps
Sharepoint 2007
Input
Linkedin
Kdb
Plone
Eclipse Plugin


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网