Python 使用BeautifulSoup解析标记_Python_Html_Parsing_Tags_Beautifulsoup - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/333.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 使用BeautifulSoup解析标记_Python_Html_Parsing_Tags_Beautifulsoup - Fatal编程技术网

Python 使用BeautifulSoup解析标记

python html parsing tags

Python 使用BeautifulSoup解析标记,python,html,parsing,tags,beautifulsoup,Python,Html,Parsing,Tags,Beautifulsoup,我遇到了一个关于BeautifulSoup的python编程问题首先，我需要创建一个函数，从网页的源页面提取所有标签。我是这样做的： from bs4 import BeautifulSoup soup=BeautifulSoup(''.join(data)) def parseUsingSoup(content): return soup.findAll('h3') 我试图解析的网站如下：它只包含一个h3标签。现在这个问题需要我扩展我的函数，这

我遇到了一个关于BeautifulSoup的python编程问题

首先，我需要创建一个函数，从网页的源页面提取所有标签。我是这样做的：

    from bs4 import BeautifulSoup

    soup=BeautifulSoup(''.join(data))

    def parseUsingSoup(content):
        return soup.findAll('h3')

我试图解析的网站如下：

它只包含一个h3标签。现在这个问题需要我扩展我的函数，这样它也会在p标签中返回所有与它相关的内容。它还要求提供一个包含四个元组的事件列表，这些元组给出事件的日期、标题、类型和描述

我真的不知道怎么做。我尝试了各种不同的方法，但没有一种方法能给我正确的结果。提前谢谢。

这里有一种方法可以让您获得

下面的所有标签：
然后，您可以将此输出解析为您认为合适的列表。谢谢！这很有帮助。是否有可能从html代码中调用事件的时间（以及类型、标题等）？这样我就可以分别为每个事件创建一个字典（然后将它们放入列表中）？是的，似乎您只需要尝试获取不同的标记，直到找到正确的标记。对于注释的第二部分，可以将结果保存到变量中，然后解析该变量。要获得更详细的答案，请创建一个新问题。
from bs4 import BeautifulSoup
import urllib2

content = 'http://www.auc.nl/news-events/events-and-lectures/events-and-lectures.html?page=1&pageSize=40'

soup = BeautifulSoup(urllib2.urlopen(content))

for x in soup.findAll('h3'):
    for y in soup.findAll('p'):
        print y




[html]相关文章推荐



                                                        
Html 将div定位在中心，但具有动态高度？
我想把一个div放在页面中间。我在互联网上找到的解决方案假设div是静态大小的。我需要div在中间，如果内容是正确的大小，但是如果它超过div的大小，它应该变得更大，并且最终允许滚动而不改变宽度。
htmlcss 
Html 使用分块编码的优势是什么？
htmlhttp 
Html 如何将边框底部应用于DIV的每一行？
htmlcss 
IIS 7出站规则未重写HTML href
htmliis-7 
Html 将div置于图像的中心
htmlcss 
Html 引导导航栏上的复制图标
htmlangularjstwitter-bootstrap 
Html 悬停Div问题
htmlcss 
Html IE不着色：以前作为表格单元格，为什么？

部门：以前{
显示：表格单元格；
内容：“ABC”；
颜色：红色；
}
123
htmlcssinternet-explorer 
Html 如何垂直对齐跨文本（浮动）到底部
htmlcss 
Html 聚合物0.5芯选择器如何使用纸张复选框获取选定值
htmlcheckboxpolymer 
Html 如何在索引为x的arraylist中获取arraylist
htmlangularjs 
如何在具有特定Id的标记上使用InnerHtml
htmlasp.netvisual-studio-2013 
Html 引导导航栏没有'；t在移动视图中收缩
htmltwitter-bootstrapcssmobile 
Html 使用AWS策略显示S3图像的Web应用程序
背景

我们有一个web应用程序，它使用aws sdk for JavaScript允许使用aws Cognito登录
我们希望允许用户根据AWS策略访问S3上的文件
使用AWS.config.credentials=new AWS.CognitoIdentityCredentials（…）我们能够执行登录并接收令牌
使用此标记，我们在bucket上执行listObject

问题
htmlamazon-s3 
Html 调整大小时，页脚社交图标的间距不均匀
htmlcsstwitter-bootstrap 
VBA代码无法读取iframe中的完整html
htmlexcelvbadomiframe 
Html CSS可以'；t将文本与：：before和：：after对齐
htmlcss 
Html 反应滚动缺少背景色
htmlcssreactjs 
Html 我使用网格布局，但最终结果没有响应
htmlcssresponsive-design 
Html 移动：：之前和：：之后
htmlcss 
                                       





随机文章推荐



                                                        
Keyboard 在RubyMine控制台中调用上一个命令
keyboard 
Keyboard 添加串行端口Arduino
keyboardarduino 
Keyboard 通用lisp，如何屏蔽键盘输入
keyboardcommon-lisp 
Keyboard 如何插入一个"—&引用；没有数字广告直接从我的键盘输入？
keyboard 
Keyboard Java/processing中的Snake游戏控制帮助
keyboardprocessing 
Keyboard 如何让小部件粘在键盘顶部
keyboardflutter


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
                                                        
                                                

                                                
                                                        Tags
                                                        
Stata
Hybris
Keyboard
Sorting
Canvas
Ms Word
Asp Classic
Google Sheets
Chef Infra
Socket.io
Ssl
Pandas
Tkinter
Ios7
Java Me
Pyspark
Amazon Cloudformation
Elixir
Struts2
Dialogflow Es
Parameters
Stream
Netty
Dask
Ffmpeg
Serial Port
Aws Lambda
Outlook
Db2
Model View Controller
Vuejs2
Templates
Charts
Ruby On Rails 3.2
Oracle Apex
Drupal 6
Windows 7
Oracle11g
Raspberry Pi
Install4j
Xamarin.forms
Webrtc
EmptyTag
Hbase
Encoding
Vba
Drupal
Android Ndk
Graphql
Mapreduce
Sonarqube
Concurrency
Modelica
Android Layout
Machine Learning
Postgresql
Jdbc
Amazon Redshift
Plugins
Dataframe
Sitecore
Paypal
Imagemagick
Zurb Foundation
Openerp
Z3
Hazelcast
Glassfish
Map
Random
React Native
Vmware
Sublimetext3
Windows Services
Xamarin
Reflection
Node.js
Keras
Xamarin.android
Timer
Sass
Parallel Processing
Aurelia
Kubernetes
Boost
Documentation
Autocomplete
Yocto
Ada
Vagrant
Pycharm
Sharepoint 2013
Ibm Midrange
.htaccess
Qt4
Amazon Dynamodb
Lotus Notes
Webview
Phantomjs
Windows
Google Cloud Firestore
Go
Spring Batch
Artificial Intelligence
Silverlight
Ide
Firefox Addon
Grafana
Matplotlib
Linq To Sql
Web
Perl
Hibernate
Svn
Ios6
Doxygen
Python 2.7
C++11
Iis
Typescript
Jsf 2
Processing
Vhdl
Windows Phone 8
Fortran
Instagram
Twilio
Version Control
Visual Studio 2012
Network Programming
Terminal
Ios8
Web Scraping
If Statement
Directory
Ios5
Scheme
Jsp
Docusignapi
Axapta
Join
Apache Spark
Cookies
Osgi
Batch File
Login
Terraform
Performance
Date
Teradata
Ssis
Google Chrome
F#
Gmail
Angularjs
Ibm Mq
Nestjs
Statistics
Compiler Errors
Plone
Exception
Xquery
Jekyll
Jpa
Time
Sms
Gps
Arrays
Influxdb
Codenameone
Syntax
Navigation
Twitter Bootstrap 3
Parse Platform
Iframe
Shell
Jquery Ui
Asp.net Mvc 4
Types
Gremlin
Dictionary
Protractor
Netsuite
Datatables
Hyperledger Fabric
Sml
Visual Studio 2017
Dependency Injection
Visual Studio 2008
Tridion
Gulp
Memory Management
Openid
Time Complexity
Jira
Umbraco
Cucumber
Erlang
Kibana
Wcf
Netbeans


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网