Python BeautifulSoup：如何提取标记值？_Python_Beautifulsoup_Web Crawler

Python BeautifulSoup：如何提取标记值？

python web-crawler

Python BeautifulSoup：如何提取标记值？,python,beautifulsoup,web-crawler,Python,Beautifulsoup,Web Crawler,我是编程新手，似乎无法解决以下数据提取问题这就是我的数据（黄色=我试图提取的内容）：提取标题、价格和时间很好： # Title advertTitle = firstAdvert.find_all( "section", {"class": "aditem-main"})[0].find("h2").text.encode("utf-8").strip().replace("\n", "") # Price advertPrice = firstAdvert.find_all( "sec

我是编程新手，似乎无法解决以下数据提取问题

这就是我的数据（黄色=我试图提取的内容）：

提取标题、价格和时间很好：

# Title
advertTitle = firstAdvert.find_all(
"section", {"class": "aditem-main"})[0].find("h2").text.encode("utf-8").strip().replace("\n", "")

# Price
advertPrice = firstAdvert.find_all(
"section", {"class": "aditem-details"})[0].find("strong").text.encode("utf-8").strip().replace("\n", "")

# Time
advertTimeAdded = advertTitle = firstAdvert.find_all(
"section", {"class": "aditem-addon"})[0].text.encode("utf-8").strip().replace("\n", "")

但我的主要问题是：如何从中提取“79924470”：

<article class="aditem" data-adid="79924470">

谢谢你给我指明了正确的方向

您可以这样做：

data = []
for element in soup.find_all({'data-adid':'79924470'}):
    data.append(element['data-adid']

这应该将

数据adid的每个值添加到列表数据
中，因为您使用的是BeautifulSoup，所以您可以这样做来提取属性的值：
soup = BeautifulSoup(file, "lxml")
print soup.article['data-adid'] # output : 79924470

可以使用一系列选择来获得以下各种元素：
from bs4 import BeautifulSoup

soup = BeautifulSoup(html, "lxml")
print soup.article['data-adid']
image = soup.select('div.imagebox.srpimagebox')[0]
print image['data-href']
print image['data-imgsrc']
print soup.select('section h2 a')[0].text
print ', '.join([v.strip() for v in soup.select('section.aditem-details')[0].text.strip().split('\n')])
print soup.select('section.aditem-addon')[0].get_text(strip=True)

其中显示：
79924470
/ref/79924470
https://imgserver.com/012004.JPG
I am a title
12.380€, 50111, Cityname, 25km
Today, 16:19

请不要在图片中发布代码。有些人可能无法打开，因此无法提供帮助-1我应该解释得更好。号码“79924470”每次都不一样。所以我有一个包含“”的列表，我正试图提取其中的数字。哇，这太有用了！非常感谢你，马丁。
79924470
/ref/79924470
https://imgserver.com/012004.JPG
I am a title
12.380€, 50111, Cityname, 25km
Today, 16:19