使用beautifulsoup和python删除某些标记_Python_Html_Beautifulsoup_Strip - Fatal编程技术网

使用beautifulsoup和python删除某些标记

python html

使用beautifulsoup和python删除某些标记,python,html,beautifulsoup,strip,Python,Html,Beautifulsoup,Strip,问题我正在尝试从我的html文件中删除样式标记，如和，该文件由BeautifulSoup下载。我确实希望保留标记包含的内容（如文本）然而，这似乎不起作用我尝试过的 for url in urls: response = requests.get(url, headers=headers) soup = BeautifulSoup(response.content, 'html.parser') table = soup.find("div", {"class": "

问题

我正在尝试从我的html文件中删除样式标记，如

和

，该文件由BeautifulSoup下载。我确实希望保留标记包含的内容（如文本）然而，这似乎不起作用

我尝试过的

for url in urls:
    response = requests.get(url, headers=headers)
    soup = BeautifulSoup(response.content, 'html.parser')
    table = soup.find("div", {"class": "product_specifications bottom_l js_readmore_content"})
    print "<hr style='border-width:5px;'>"
    for style in table.find_all('style'):
        if 'style' in style.attrs:
            del style.attrs['style']
    print table

对于url中的url：
response=requests.get（url，headers=headers）
soup=BeautifulSoup（response.content'html.parser'）
table=soup.find（“div”，“class”：“产品规格底部”\u l js\u readmore\u内容”}）
打印“


我尝试使用的URL


您可以使用分解（）：

如果只想清除文本或从树中删除元素，请使用clear
和extract
（上面的描述就是分解）。
您正在查找unwrap（）
您的\u soup.tag.unwrap（）
您尚未解释当前解决方案的问题所在。该解决方案从原始页面开始仍为样式。我仍需要保留其内容。我只想删除标签本身。因为它是我的文件的样式，我不想这样做，所以请在问题中解释它，而不是我的评论




[html]相关文章推荐



                                                        
Html 隐藏部分div-单击时切换为打开
html 
如何使背景图像进入；背景“；使用CSS/HTML
htmlcss 
For循环在html输出中显示其他表行
htmlfor-loop 
Html 显示所有文本文档
htmlcss 
Html 我的元素没有填满页面的整个空间
htmlcss 
Html 类型错误：无法读取值'；名称'；未定义的
htmlangularjsfirebase 
Html 使用meanjs从对象值数组中获取下拉列表？
htmlangularjs 
Html 如何在记事本++；？
htmlnotepad++ 
Html 我的例子主体上的空白来自哪里？
htmlcssreactjs 
Html 将多个直接图像链接插入到<；img src=”等&引用&燃气轮机；代码
htmlimagehyperlink 
Html 为什么我的引导列是堆叠的而不是水平的
htmlcsstwitter-bootstrap 
Html 正在从queryset获取ITEM.Category
htmldjango 
Html 为什么'；t高度：100%工作以将div扩展到屏幕高度？
htmlcss 
Html “奇怪”；（双引号）出现在数字和%（百分比）符号之间的空白处
htmlcss 
Html 如何使用Jquery动态创建div/card
htmljquerycss 
Html 是否可以将CSS@media规则内联？
htmlcss 
Html 为什么我的底层云不像顶层云那样采用css样式？
htmlcss 
如何在我的html页面中添加Matplotlib live graph？
htmlpython-3.x 
Html 如何消除导航栏中的空白？
htmlcss 
Html 在没有灰色边框和缩放工具的情况下从google drive嵌入图像？
htmlgoogle-apps-scriptiframegoogle-drive-api 
                                       





随机文章推荐



                                                        
Cucumber 我如何得到黄瓜&；capybara是否使用http://路径而不是file:///路径？
cucumberruby-on-rails-3.1 
Cucumber Capybara-根据属性查找元素'；内容
cucumberautomated-tests 
Cucumber 将值列表传递给场景大纲
cucumber 
Cucumber 如何通过声明性验收测试捕获需求？
cucumber 
Cucumber：使用Poltergeist（PhantomJS）禁用或删除本地存储
cucumberphantomjs 
Cucumber 外部数据源与特征文件的集成
cucumber 
Cucumber 是否可以在SitePrism字段中找到不区分大小写的元素？
cucumber 
Cucumber 水豚如何测试下拉列表中的禁用选项
cucumber 
Cucumber 使用'的正确方法是什么*'；特征文件中的关键字
cucumber 
cucumber hook scenario.embed始终在项目根目录下创建屏幕截图
cucumber 
Cucumber io.CUMBURE.testng中的备选项
cucumbertestng


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
Python中的MATLAB interp2函数
									Python
							 									Matlab
							 
&引用；“穿线”；在Python中，绘制接收到的数据并同时发送
									Python
							 									Multithreading
							 									Wxpython
							 
Python 如何提高NLTK的性能？选择？
									Python
							 
Python Pygame菜单：调用另一个.py文件
									Python
							 									Import
							 
nohup上的python子进程输出
									Python
							 
Python3-标识符中的无效字符（在公式中）
									Python
							 
Python 如何读取文件并将其存储在字典中。不用拉链
									Python
							 									Dictionary
							 
Python numpy/scipy：使一个系列在一段时间后收敛到另一个系列
									Python
							 									Numpy
							 									Pandas
							 
Python Plotly：如何添加自定义图例
									Python
							 
Python 无法安装该软件包"；电邮；
									Python
							 									Linux
							 									Python 3.x
							 									Ubuntu
							 
安装测试版发行版（python）-请澄清
									Python
							 
Python 如何索引astropy表中的可观测坐标
									Python
							 
Python 为什么scipy.optimize.minimize（默认）报告成功而不使用Skyfield移动？
									Python
							 
Python 如何创建一段代码来检查一个数字的最大素数因子？
									Python
							 									Algorithm
							 									Python 3.x
							 
在Python中使用sys.stdin.readline（）从cmd读取多行
									Python
							 
Python 斯卡皮在窗户上嗅东西
									Python
							 
从CSV文件中读取随机行只读该行并移动到Python中的另一个CSV
									Python
							 									Csv
							 									Random
							 
用python从html中提取excel文件
									Python
							 									Vba
							 									Python 3.x
							 									Excel
							 
PythonPyGame错误；精灵实例没有“调用”方法；
									Python
							 
Python 检测到人时显示图像
									Python
							 									Opencv
							 
Python ssl套接字类型错误
									Python
							 									Python 2.7
							 									Sockets
							 
python脚本中的shell脚本，参数为sql脚本
									Python
							 									Sql
							 									Shell
							 
Python 在循环中仅打印一个箭头
									Python
							 									Matplotlib
							 
Python 从行中提取特定列并合并列
									Python
							 									Pandas
							 
Python discord.ext.commands.errors.CommandInvokeError:命令引发异常：AttributeError:'；命令'；对象没有属性'；服务器'；
									Python
							 									Discord.py
							 
Python panda数据帧中多个simliar实体的处理
									Python
							 									Pandas
							 									Dataframe
							 
Python正则表达式
									Python
							 									Regex
							 
Python 创建/调用类实例，未定义错误对象
									Python
							 									Class
							 
Python 无法为此格式设置strtime格式2018-07-26 12:52:18.679605-07:53
									Python
							 									Date
							 
从Python中3d数组的每个2d元素中减去2d数组将Matlab代码转换为Python
									Python
							 									Loops
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Javafx
Vbscript
Debian
Linq To Sql
Youtube Api
Dotnetnuke
Swift
Formatting
Umbraco
Elm
Isabelle
Alfresco
Ecmascript 6
Dataframe
Scikit Learn
Unix
Stored Procedures
Tkinter
C# 4.0
Programming Languages
Netbeans
Servlets
Sockets
Reference
Yaml
Apache Zookeeper
Struts2
Actionscript
Json
Apache Flink
Compilation
Chef Infra
Url
Deep Learning
Pytorch
Nosql
C++11
Ip
Math
Azure Sql Database
Angular Material
Google Cloud Storage
Gmail
Woocommerce
Phpstorm
Asp.net Mvc 3
Scroll
Nuget
Robotframework
Openerp
Doctrine
Chart.js
Big O
Ssrs 2008
Process
Jboss
Ajax
Couchbase
Uiview
Ionic2
Jekyll
Tfs
Vagrant
Phpunit
Python Sphinx
Gruntjs
Sencha Touch 2
Ruby
Ipad
Numpy
Itext
Pentaho
Reporting Services
List
Networking
Rest
Protocol Buffers
Nsis
Hibernate
Tabs
Drupal 6
Jakarta Ee
Discord
Electron
Notifications
Git
Embedded
Opengl
Github
Grid
3d
Perforce
Antlr
Web Crawler
Jenkins
.htaccess
.net 4.0
Linker
Graphql
File Io
Select
Cmd
Opencl
Nest
Hyperledger Fabric
Csv
Combobox
Jquery Mobile
Permissions
Blazor
Dll
Solr
Excel Formula
Webstorm
Phantomjs
Time
Templates
Functional Programming
Random
Asp.net Core
Anaconda
Entity Framework Core
Batch File
Common Lisp
Ada
Angularjs
Doxygen
Kibana
Stream
Google Maps
Php
Service
Parsing
Soap
Encoding
Ionic Framework
Data Structures
Visual Studio 2010
Wso2
Azure
Scripting
Llvm
Windows 10
React Native
Hadoop
Cloud
Openstack
Snmp
Omnet++
Smalltalk
Keras
Streaming
Oracle11g
Tableau Api
Tcl
Jquery Ui
Amazon Cloudformation
Cobol
Primefaces
Apache Camel
Salesforce
Colors
Configuration
Flask
Rspec
Google Analytics
Spring Cloud
Requirejs
Autohotkey
Zend Framework2
Gridview
Mercurial
Iframe
Signalr
Terraform
Kernel
Reflection
Spring
Udp
Oracle Apex
Swing
Ruby On Rails
Webpack
Frameworks
Arm
Qt4
Geolocation
Awk
Neural Network
Cygwin
Asp Classic
Visual C++
Osgi
Jupyter Notebook
Command Line
Tags
.net
Windows Phone 7
Discord.js
Cakephp
Rally


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网