Python 如何删除特定类的标记？_Python_Python 3.x_Beautifulsoup - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/8/python-3.x/16.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 如何删除特定类的标记？_Python_Python 3.x_Beautifulsoup - Fatal编程技术网

Python 如何删除特定类的标记？

python python-3.x

Python 如何删除特定类的标记？,python,python-3.x,beautifulsoup,Python,Python 3.x,Beautifulsoup,我正在使用Beautifulsoup（python3.x）解析HTML页面我正在尝试从我编写的标记中获取数据 def getBody(url): html_page = requests.get(url) soup = BeautifulSoup(html_page.content, 'html.parser') Con = "".join([p.text for p in soup.find_all("p")]) #print(Con) return Con

我正在使用Beautifulsoup（python3.x）解析HTML页面我正在尝试从我编写的标记中获取数据

def getBody(url):
    html_page = requests.get(url)
    soup = BeautifulSoup(html_page.content, 'html.parser')
    Con = "".join([p.text for p in soup.find_all("p")])
    #print(Con)
return Con

但在这样做的过程中，我从下面的htmltag获得了文本。我怎样才能删除这个

本文的评论已关闭。
您可以使用或删除标记
>>> from bs4 import BeautifulSoup
>>> html = '''
... <p>text</p>
... <p class="notice">Comments are closed for this article.</p>
... <p>text</p>
... <p class="notice">Comments are closed for this article.</p>
... <p>text</p>'''
>>> soup = BeautifulSoup(html, 'html.parser')
>>> for tag in soup.find_all('p', class_='notice'):
...     tag.decompose()
...
>>> soup

<p>text</p>

<p>text</p>

<p>text</p>

>>来自bs4导入组
>>>html=“”
...  正文
... 此文章的评论已关闭
...  正文
... 此文章的评论已关闭
...  文本“”
>>>soup=BeautifulSoup（html，'html.parser'）
>>>用于汤中的标记。查找所有（'p'，class='notice'）：
...     tag.decompose（）
...
>>>汤
正文
正文
正文




[python 3.x]相关文章推荐



                                                        
                                       





随机文章推荐



                                                        
需要找出freebase quad rdfize的源代码
rdf 
Rdf 在构建语义web应用程序时，OWL实际上是如何使用的？
rdf 
Rdf 通过正则表达式查询主语或谓词
rdfsparql 
Rdf 海龟中的GML字符串
rdf 
在RDF URI中不使用驼峰大小写？
rdf 
Rdf Jena-从任何语言的本体类中获取标签
rdf 
Rdf 合并两个Jena模型
rdf 
Rdf Jena为本体类显示了错误的名称空间
rdf 
Rdf 如何通过Sparql查询在Jena中进行推理
rdfsparql 
Rdf 在OWL中定义数据属性的基数
rdf 
Rdf 是否可以检索三元组'；DBPedia中的源数据集是什么？
rdfsparql 
Rdf 杰纳暴乱中的JSON-LD？
rdf 
Rdf Sparql查询中的多重计数
rdfsparql 
Rdf 检查值是否对SPARQL中的自定义数据类型有效
rdfsparql 
控制RDF到“的转换；“更漂亮”；JSON-LD
rdf 
Rdf 我如何翻译像“这样的主题？”；www.freebase.com/m/0cz9079“；变成人类可以阅读的文字？
rdf 
RDF语句可以有多个主题吗？
rdf


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
如何探索python中的包
									Python
							 
python的trackback错误-使用pymssql
									Python
							 
Python HTML表格--在标题中放置链接以进行排序？（无JavaScript）
									Python
							 									Html
							 									Django
							 									Hyperlink
							 
Python：获取主卷windows 7
									Python
							 									Windows
							 									Audio
							 
Python 子进程popen stdout锁定？
									Python
							 									Python 2.7
							 
Python 获取边g.edge_iter末端的节点
									Python
							 
Python 让PyCharm认出水蟒的SciPy
									Python
							 									Pycharm
							 									Anaconda
							 
Python 如何从文件中的每一行识别多个单词和相应的值，例如：“status”：“ok”
									Python
							 									Json
							 									String
							 									List
							 
Python MySQLdb存储过程输出参数不工作
									Python
							 									Mysql
							 
Python 红移复制操作在SQLAlchemy中不起作用
									Python
							 									Sqlalchemy
							 									Amazon Redshift
							 
在Python中，在函数上使用协程有哪些方面？
									Python
							 
在Python中添加内容处置标头-不发送电子邮件
									Python
							 									Email
							 
Python 我的rk4实现的加权平均中的不同近似值都是相同的
									Python
							 
Python BST中的第k个最小元素
									Python
							 
Python无法安装PyGObject
									Python
							 									Windows
							 									Python 3.x
							 									Pip
							 
Python 基于某些模式和区块内的信息拆分文件
									Python
							 									Bash
							 									Perl
							 
Python 如何加载动态内容
									Python
							 									Flask
							 
Python 使用selenium execute使用javascript刮取站点
									Python
							 
Python PostgreSQL-查询所有表的所有表列
									Python
							 									Sql
							 									Arrays
							 									Database
							 									Postgresql
							 
Python odoo更改图形的默认类型
									Python
							 									Odoo
							 
Python 从列表的每个元素中获取子字符串
									Python
							 									Html
							 									Python 3.x
							 									Web Scraping
							 
Selenium Python编码选择下拉列表：获取错误SeleAttributeError:“list”对象没有属性“tag\u name”
									Python
							 									Selenium
							 									Select
							 
为什么VisualStudio代码中的Python repl告诉我我的对象未定义？
									Python
							 									Visual Studio Code
							 
Python 动态刮取JSON值
									Python
							 									Json
							 									Web Scraping
							 									Scrapy
							 
Python 使用数据和JSON参数发送post请求
									Python
							 
Python/SQL-基于现有已填充记录的回滚/转发来填充记录的循环
									Python
							 									Sql
							 									Apache Spark
							 									Pyspark
							 
Python Tkinter使用鼠标在图像上选择一个区域并存储选择的坐标
									Python
							 									Tkinter
							 
为什么我的Python程序将数字1识别为素数？
									Python
							 
Python 在另一列中筛选同时具有null和not null值的值
									Python
							 									Pandas
							 
Python PySpark使用两种不同的文件类型从s3中的zip文件读取csv
									Python
							 									Apache Spark
							 									Amazon S3
							 									Pyspark
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
C# 4.0
Servlets
Windows Installer
Tinymce
Ubuntu
Linker
Cors
Dom
Antlr4
Ecmascript 6
Knockout.js
Socket.io
Cypress
Swift3
Smtp
Asp.net Mvc
Activemq
Sencha Touch 2
Replace
Linux
Process
Sip
Jira
Floating Point
Netlogo
Kubernetes
Node.js
Xml
Join
Authentication
Cloud
Material Ui
Pip
Opengl
Spring Cloud
Wpf
Prometheus
Iphone
Air
Gremlin
Three.js
Io
Swift2
Rabbitmq
Scikit Learn
Glassfish
Office365
Random
Smalltalk
Class
Html
Spring Batch
Plsql
Actionscript 3
Calendar
Kernel
Amp Html
Jmeter
Selenium Webdriver
Coffeescript
Quickbooks
File Io
.net 4.0
Validation
Concurrency
Optimization
Javafx
Core Data
Ip
Virtual Machine
Forms
Cassandra
Graphql
Recursion
Filter
Azure Data Factory
Ibm Mq
Pagination
Uitableview
Mapbox
Log4net
Verilog
Clojure
Tsql
Appium
Intellij Idea
Sms
Gwt
Mongodb
Syntax
Bazel
Interface
Akka
Visual C++
Indexing
Logstash
Eclipse Plugin
Pytorch
Jar
Haskell
Merge
Acumatica
Amazon Redshift
Go
Programming Languages
Microservices
Raspberry Pi
Grid
Eclipse
Service
Meteor
Cocoa Touch
Windows Store Apps
Discord.js
Pandas
Continuous Integration
Fiware
Nestjs
Seo
Configuration
Windows Runtime
Groovy
Vue.js
Internet Explorer
Applescript
Silverlight
Serialization
Umbraco
Google Bigquery
Sitecore
Office Js
Orientdb
Winapi
For Loop
Input
Openerp
Unix
Ipython
Nhibernate
Gmail
Directx
Airflow
Svn
Google Cloud Dataflow
Url
Ssh
Report
Rx Java
Version Control
C#
Blockchain
Hive
Ibm Midrange
Memory
Laravel 5
Erlang
Time
Terminal
Razor
Regex
Character Encoding
Keras
Variables
Jpa
Python 2.7
Axapta
Imagemagick
Data Binding
Computer Vision
Typescript
Prestashop
View
Java Me
Tomcat
Cmake
Xamarin.ios
Design Patterns
Jwt
Openlayers 3
Rss
Twitter
Iis
Mapping
Llvm
Loops
Url Rewriting
Sas
Xamarin.forms
Cron
Xpath
Tridion
Angular
Dns
Fullcalendar
Asp.net Mvc 5
Maven 2
Dependency Injection
Azure Sql Database
Events
Enums
Vagrant


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网