Python 3.x 从<;正确获取数据;span>;使用beautifulsoup、python标记
我爬一个观光和活动页面,以获得该页面中提到的价格。以下方框中提到了价格:Python 3.x 从<;正确获取数据;span>;使用beautifulsoup、python标记,python-3.x,beautifulsoup,request,web-crawler,Python 3.x,Beautifulsoup,Request,Web Crawler,我爬一个观光和活动页面,以获得该页面中提到的价格。以下方框中提到了价格: <div class="price-info" data-origin-price="1200" data-lowest-price="1200.0" data-origin-ccy="JPY" data-discount-percentage="60"> <span class="before-discount-row"> <span class="before
<div class="price-info" data-origin-price="1200" data-lowest-price="1200.0" data-origin-ccy="JPY" data-discount-percentage="60">
<span class="before-discount-row">
<span class="before-discount">25.12</span>
<span class="currency">EUR</span>
</span>
<span class="price-row">
<span class="price-prefix">From</span>
<span class="price">10.05</span>
<span class="currency">EUR</span>
</span>
这是我得到的产出,而不是10.05欧元
[<span class="price"></span>]
[]
有人能帮我吗?有没有办法把数字从范围中去掉
谢谢您的帮助:)我想您忘记了最后一个for循环:
g_data = soup.find_all("li", {"class": "activity-list"})
for item in g_data:
prices = item.find_all("div", {"class": "price-info"})
for t in prices:
final_prices = item.find_all("span", {"class": "price"})
for p in final_prices:
print(p)
内容是动态的,这就是问题所在。@ElvirMuslic有什么办法解决这个问题吗?是的,使用“假/自动/模拟浏览器”Selenium,在youtube上查找。不幸的是,它不起作用,仍然得到相同的输出。
[<span class="price"></span>]
g_data = soup.find_all("li", {"class": "activity-list"})
for item in g_data:
prices = item.find_all("div", {"class": "price-info"})
for t in prices:
final_prices = item.find_all("span", {"class": "price"})
for p in final_prices:
print(p)