使用Python 2.7x从href标记提取字符串_Python_Regex_Python 2.7_Beautifulsoup - Fatal编程技术网

使用Python 2.7x从href标记提取字符串

python regex python-2.7

使用Python 2.7x从href标记提取字符串,python,regex,python-2.7,beautifulsoup,Python,Regex,Python 2.7,Beautifulsoup,我目前正在使用Beautifulsoup4从HTML页面提取“a href”标记。我正在使用Beautifulsoup4中的find_all查询，它工作正常，并返回我正在寻找的“a href”标记。返回内容的示例如下： "<a href="manage/foldercontent.html?folder=Pictures" style="background-image: url(shares/Pictures/DefaultPicture.png)" target="content_wi

我目前正在使用Beautifulsoup4从HTML页面提取“a href”标记。我正在使用Beautifulsoup4中的find_all查询，它工作正常，并返回我正在寻找的“a href”标记。返回内容的示例如下：

"<a href="manage/foldercontent.html?folder=Pictures" style="background-image: url(shares/Pictures/DefaultPicture.png)" target="content_window" title="Vaya al recurso compartido Pictures">Pictures</a>"

req = urllib2.Request(example_url)
response = urllib2.urlopen(req)
soup = BeautifulSoup(response.read(), from_encoding=response.info().getparam('charset'))
for link in soup.find_all('a', href=True):
    # The below 'if' is to filter out only relevant 'a href' tags
    if "foldercontent.html?folder" in link['href']: 
        print link

这是否可能通过修改我搜索的内容实现，或者我必须在返回的字符串中运行正则表达式？

您可以使用：

[仅获取URL路径或查询字符串，或将查询字符串解析为其组成部分。
您可以使用：
[仅获取URL路径，或仅获取查询字符串，或将查询字符串解析为其组成部分
for link in soup.select('a[href*="foldercontent.html?folder"]'):




[regex]相关文章推荐



                                                        
Regex Apache：阻止除列出的目录以外的所有目录
regexdirectoryapache2 
Regex 用于IP验证的正则表达式，范围为？
regex 
Regex 使用pyparsing解析正则表达式列表（字面上）
regex 
Regex 如何在Eclipse中保存查找/替换设置？
regexeclipsereplace 
Regex 正则表达式，用于将一个字符中的多个字符替换为空白字符
regexsolr 
Regex 如何在emacs中用[A]替换空格
regexemacs 
Regex 带有awk或gawk的正则表达式
regexawk 
Regex Searchpattern"；？：&引用；
regexperl 
Regex 如何使用sed替换配置文件'；什么是变量？
regexlinuxbashsedreplace 
Regex 正则表达式在特定字符上不提供匹配项
regex 
Regex 使用查找缩写的正则表达式模式
regex 
Regex 蜂巢正则表达式不工作
regexhadoophive 
Regex 如何删除我的正则表达式中的尾随空格？
regex 
Regex 第2次出现之间的正则表达式PCRE字符
regexr 
Regex-匹配表示变量及其赋值的字符串的模式
regex 
Regex 正则表达式匹配到字符前面的第一个空格
regex 
Regex 正则表达式查找单个数字的单词
regex 
Regex VBA正则表达式：如何在特定字符串后找到数字的第一个实例并忽略所有其他字符？
regexvba 
Regex Spark用正则表达式替换列子字符串
regexapache-spark 
Regex 普查数据的正则表达式
regex 
                                       





随机文章推荐



                                                        
Automated tests 在Windows XP虚拟机上远程执行编码的UI测试
automated-tests 
Automated tests 黄瓜试验.特征不'；我看不见我的脚步
automated-testscucumber 
Automated tests 自动化测试-Katalon studio
automated-tests 
Automated tests 丝绸测试目视测试按回车键
automated-tests 
Automated tests 空手道：空手道中是否有一个http请求钩子，在每次API调用后自动调用，我可以修改它的行为？
automated-testskarate


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
如果前一行被其他文本占用，我将如何告诉Python写入新行？
									Python
							 
Python 生成动态表单并使用逗号分隔的值更新数据库
									Python
							 									Django
							 									Django Models
							 
Python 按不区分大小写的字母顺序旋转（重塑）数据帧
									Python
							 									Pandas
							 									Dataframe
							 
Python TensorFlow tf.data processing dev在每个历元后设置
batch\u size=2
x_dim=2
m=5
m_dev=4
时代=2
#玩具数据
X_序列=np.random.randn（m，X_dim）
Y_train=np.random.randint（0,5，size=m）。重塑（-1,1）
X_dev=np.random.randn（m_dev，X_dim）
Y_dev=np.random.randint（0,5，size=m_dev）。重塑（-1,1）
X=tf.p
									Python
							 									Tensorflow
							 									Machine Learning
							 
Python 如何计算多指标DF的选择范数
									Python
							 
Python 写groupby之类的东西有更好的方法吗？
									Python
							 
Python 通过django.can'；t不显示下一页>；
									Python
							 									Html
							 									Django
							 									Django Models
							 
seleniumpython中的Web抓取
									Python
							 									Selenium
							 									Web Scraping
							 
Python Pyplot errorbar无法传递颜色数组
									Python
							 									Matplotlib
							 
Python 在单个数据帧或X射线中存储多个2D阵列
									Python
							 									Arrays
							 									Pandas
							 									Numpy
							 
Python 为什么在尝试从会话执行存储过程时会出现语法错误？
									Python
							 									Sqlite
							 									Sqlalchemy
							 
Python 重命名与初始循环相同的循环中的节点
									Python
							 									Algorithm
							 
Python Tkinter：如何禁用任务栏中的窗口显示？
									Python
							 									Python 3.x
							 									Tkinter
							 
使用套接字从Python中的服务器发送网络摄像头流
									Python
							 									Image
							 									Sockets
							 									Opencv
							 									Raspberry Pi
							 
Python 如何在C程序中从返回的numpy数组中检索数据？
									Python
							 									C
							 									Arrays
							 									Numpy
							 
对于google DLP的DeIdentity_with_fpe（）Python API包装器，需要传递哪些参数？
									Python
							 									Google Cloud Platform
							 
允许线程继续的输入python超时
									Python
							 									Python 3.x
							 									Function
							 									Time
							 
在Python中的多重继承中使用相同的函数
									Python
							 									Python 2.7
							 
Python 从文本文件中计算同一单词的多个实例
									Python
							 
Python 如何在Netflix数据中使用LabelEncoder？
									Python
							 									Pandas
							 
Python 移动到Web Scraper中具有相同名称的下一个类
									Python
							 
Python Spark为每一列创建一个包含总和的行（就像每一列的总和）
									Python
							 									Scala
							 									Apache Spark
							 
如何在on_消息回调中返回发布消息而不打印它（python）
									Python
							 									Python 3.x
							 									Mqtt
							 
在python中，Beauty Soup无法通过登录页
									Python
							 									Linux
							 									Shell
							 
Python tf\u聚集\u和张量\u分散\u批量更新
									Python
							 									Tensorflow
							 									Matrix
							 
python中feedparser的输出意外截断
									Python
							 									Rss
							 
Python 如何使用Django xml解析器？
									Python
							 									Django
							 									Xml
							 									Serialization
							 									Django Rest Framework
							 
Python Numpy中的行添加或列添加
									Python
							 									Python 3.x
							 									Numpy
							 
Python ModelForm在Django处没有指定的模型类错误
									Python
							 									Django
							 									Web
							 
Python 在f2py中处理字符数组
									Python
							 									Fortran
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Azure Sql Database
Graph
Hibernate
Pyspark
Wicket
Salesforce
Ipython
Vb.net
Jboss
Corda
Virtual Machine
Wix
Server
Antlr4
Google Colaboratory
Mariadb
Openlayers
Datetime
Ios7
Windows
Oauth 2.0
Dynamic
Gruntjs
Continuous Integration
Quickbooks
Xslt
Ms Access
Compression
Django
Sublimetext2
Networking
Jasper Reports
Javafx
Titanium
Oauth
Asp.net Mvc 2
Big O
Responsive Design
Drupal
Laravel
Ruby On Rails 3.1
Collections
String
F#
Xcode
Hbase
Https
Go
Microservices
Netty
Time Complexity
Sdk
Pagination
Asp.net Mvc 4
Sms
Cookies
Phpunit
Vmware
Ecmascript 6
Indexing
Forms
Sencha Touch 2
Exchange Server
Ruby On Rails
Path
Coq
Cordova
Xml
Checkbox
Perforce
Push Notification
Apache Flink
Memory
Architecture
Perl
Puppet
Computer Science
Uml
Linux
Apache
Openid
Shopify
Binary
Google Analytics
Moodle
Dotnetnuke
Apache Spark
Breeze
Neo4j
Spring Integration
Reporting Services
Gradle
Arrays
Bootstrap 4
Boost
Mod Rewrite
Character Encoding
Exception
Ruby
Orientdb
Zend Framework2
Automated Tests
Menu
Entity Framework Core
Winforms
Web Scraping
Tfs
Svg
Entity Framework 4
Grails
Mule
Grafana
Laravel 5
Keyboard
Google Bigquery
Printing
Scrapy
Tensorflow
Navigation
Phpmyadmin
Amp Html
Processing
Asp.net Mvc 5
Keras
Gstreamer
Opencart
For Loop
Sql Server 2008
Knockout.js
Compiler Errors
Jaxb
Notepad++
Iis
Iframe
Import
Generics
Magento2
Tcp
Ios5
Asynchronous
Notifications
Backbone.js
Glassfish
Windows Phone 7
Openerp
Memory Leaks
Drupal 7
Wso2
Xamarin.ios
Unicode
Android Studio
Layout
Kubernetes
Android Emulator
Ant
Sass
Parse Platform
Axapta
Merge
Xamarin
Batch File
Redis
Performance
Office Js
Lua
Devexpress
Blackberry
Sql
Cocoa Touch
Log4net
Enums
Eclipse
Mysql
Python Sphinx
Cocos2d X
Javascript
Nuget
Tinymce
Rust
Combobox
Verilog
Ionic2
Primefaces
Internationalization
Mapreduce
Oracle11g
Testing
Hadoop
Telegram
Time
Doctrine
Ftp
Fullcalendar
Dataframe
Windows Phone 8.1
Google Chrome Extension
Xaml
Mvvm
Google Cloud Firestore
Windows Services
Windows Installer


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网