Python BeautifulSoup刮削跨度等级=”的;“价值”;
我正试图使用python从Python BeautifulSoup刮削跨度等级=”的;“价值”;,python,web-scraping,beautifulsoup,Python,Web Scraping,Beautifulsoup,我正试图使用python从中获取信息。代码在我正在抓取的页面上看起来是这样的: <li class="item"> <span class="name">Sara</span> <span class="value">selling potato in town</span> </li> <li class=
中获取信息。代码在我正在抓取的页面上看起来是这样的:
<li class="item">
<span class="name">Sara</span>
<span class="value">selling potato in town</span>
</li>
<li class="item">
<span class="name">Grouping</span>
<span class="value">clothes</span>
</li>
<li class="item">
<span class="name">Phone</span>
<span class="value">
04142018071 09128983727
</span>
</li>
从bs4导入美化组
html_doc=“”
萨拉人
在城里卖土豆
`
"""
soup=BeautifulSoup(html_doc,'html.parser')
Content=soup.find(“li”,“class”:“item”})
name=(Content.find(“span”,{“class”:“name”}).get_text()
value=(Content.find(“span”,“class”:“value”}).get_text()
印刷品(名称)
打印(值)
试试这个
from simplified_scrapy.simplified_doc import SimplifiedDoc
doc = SimplifiedDoc(html)
lst = doc.getElements(tag='li',value='item')
for i in lst:
i = i.getChildren()
for j in i:
print ('%s=%s' % (j['class'],j.text))
我们需要看到的是您的代码工作-您使用什么方法来获取这些数据?什么语言?你遇到了什么问题?
from bs4 import BeautifulSoup
html_doc = """
<li class="item">
<span class="name">Sara</span>
<span class="value">selling potato in town</span>
</li>`
"""
soup = BeautifulSoup(html_doc, 'html.parser')
Content = soup.find("li",{"class":"item"})
name=(Content.find("span",{"class":"name"}).get_text())
value=(Content.find("span",{"class":"value"}).get_text())
print(name)
print(value)
from simplified_scrapy.simplified_doc import SimplifiedDoc
doc = SimplifiedDoc(html)
lst = doc.getElements(tag='li',value='item')
for i in lst:
i = i.getChildren()
for j in i:
print ('%s=%s' % (j['class'],j.text))