访问html元素BeautifulSoup Python2.7
我在获取列表中所有html的href类属性值时遇到问题。我不确定我做错了什么,我甚至无法访问参考资料 下面是我试图解析的内容的一小部分:访问html元素BeautifulSoup Python2.7,python,html,beautifulsoup,Python,Html,Beautifulsoup,我在获取列表中所有html的href类属性值时遇到问题。我不确定我做错了什么,我甚至无法访问参考资料 下面是我试图解析的内容的一小部分: <!-- <div class="container"> <div class="row"> <div class="col-xs-12 col-md-offset-2 col-md-8 col-md-offset-2"> <div id='location_li
<!-- <div class="container">
<div class="row">
<div class="col-xs-12 col-md-offset-2 col-md-8 col-md-offset-2">
<div id='location_list'><h2>Browse by location</h2><ol class='suburb_locations'><div class="row"><li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/nsw/abbotsford-nsw">abbotsford, NSW</a><br><span class="sub_title">0 active owners</span><span class="sub_title">0 active borrowers</span></li><li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/vic/abbotsford-vic">abbotsford, VIC</a><br><span class="sub_title">0 active owners</span><span class="sub_title">0 active borrowers</span>
问题是
您可以访问什么。。。这就解释了为什么我什么都得不到。我是直接从网站上读到的,有这样的打印件。你是说我应该这样清理它吗?你能分享这个链接吗?是的,它在源代码中被完全注释掉了,所以bs4会忽略它,当你替换时,你会得到一个“AttributeError:addinfourl实例没有属性‘replace’”。你建议我如何解决这个问题?@FancyDolphin,我补充了怎么做
from bs4 import Beautiful Soup
soup=BeautifulSoup(html,"html5lib")
print soup.find_all('br')
print soup.find_all('div h2 ol li')
print soup.find('li',{'class':"col-sm-3"})
In [2]: from bs4 import BeautifulSoup
In [3]: soup = BeautifulSoup(html,"lxml")
In [4]: print(soup)
In [5]: soup = BeautifulSoup(html.replace("<!--",""),"lxml")
In [6]: print(soup)
<html><body><div class="container">
<div class="row">
<div class="col-xs-12 col-md-offset-2 col-md-8 col-md-offset-2">
<div id="location_list"><h2>Browse by location</h2><ol class="suburb_locations"><div class="row"><li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/nsw/abbotsford-nsw">abbotsford, NSW</a><br/><span class="sub_title">0 active owners</span><span class="sub_title">0 active borrowers</span></li><li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/vic/abbotsford-vic">abbotsford, VIC</a><br/><span class="sub_title">0 active owners</span><span class="sub_title">0 active borrowers</span></li></div></ol></div></div></div></div></body></html>
In [6]: soup.select(".col-sm-3")
Out[6]:
[<li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/nsw/abbotsford-nsw">abbotsford, NSW</a><br/><span class="sub_title">0 active owners</span><span class="sub_title">0 active borrowers</span></li>,
<li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/vic/abbotsford-vic">abbotsford, VIC</a><br/><span class="sub_title">0 active owners</span><span class="sub_title">0 active borrowers</span></li>]
In [7]: soup.select(".col-sm-3")[0].text
Out[7]: u'abbotsford, NSW0 active owners0 active borrowers'
import requests
r = requests.get("http://www.carnextdoor.com.au/find-a-car/")
from bs4 import BeautifulSoup
soup = BeautifulSoup(r.content.replace("<!--",""))
print(soup.select("div #location_list"))
[<div id="location_list"><h2>Browse by location</h2><ol class="suburb_locations"><div class="row"><li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/nsw/abbotsford-nsw">abbotsford, NSW</a><br/><span class="sub_title">0 active owners</span><span class="sub_title">0 active borrowers</span></li><li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/vic/abbotsford-vic">abbotsford, VIC</a><br/><span class="sub_title">0 active owners</span><span class="sub_title">0 active borrowers</span></li><li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/vic/aberfeldie">aberfeldie, VIC</a><br/><span class="sub_title">0 active owners</span><span class="sub_title">1 active borrower</span></li><li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/sa/adelaide">adelaide, SA</a><br/><span class="sub_title">0 active owners</span><span class="sub_title">1 active borrower</span></li></div><div class="row"><li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/act/ainslie">ainslie, ACT</a><br/><span class="sub_title">0 active owners</span><span class="sub_title">1 active borrower</span></li><li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/vic/aireys-inlet">aireys inlet, VIC</a><br/><span class="sub_title">0 active owners</span><span class="sub_title">1 active borrower</span></li><li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/vic/airly">airly, VIC</a><br/><span class="sub_title">0 active owners</span><span class="sub_title">1 active borrower</span></li><li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/vic/airport-west">airport west, VIC</a><br/><span class="sub_title">0 active owners</span><span class="sub_title">1 active borrower</span></li></div><div class="row"><li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/vic/albert-park">albert park, VIC</a><br/><span class="sub_title">0 active owners</span><span class="sub_title">5 active borrowers</span></li><li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/sa/aldgate">aldgate, SA</a><br/><span class="sub_title">1 active owner</span><span class="sub_title">2 active borrowers</span></li><li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/nsw/alexandria">alexandria, NSW</a><br/><span class="sub_title">1 active owner</span><span class="sub_title">53 active borrowers</span></li><li class="col-sm-3"><a href="http://www.carnextdoor.com.au/car-rental/nsw/alexandria-mc">alexandria mc, NSW</a><br/><span class="sub_title">0 active owners</span><span class="sub_title">1 active borrower</span></li></div><div class="row"><li class="col-sm-3"><a href="http://www.carnextdoor.com