Python 获取数字列表
我用了很好的汤来上课Python 获取数字列表,python,beautifulsoup,Python,Beautifulsoup,我用了很好的汤来上课 soup.select('.pr-xs') 导致 [<span class="instructor-block__students-subscribed pl-xs pr-xs"> 1,184,500 students </span>, <span class="instructor-block__students-subscribed pl-xs pr-xs"> 697,000 students
soup.select('.pr-xs')
导致
[<span class="instructor-block__students-subscribed pl-xs pr-xs">
1,184,500 students
</span>, <span class="instructor-block__students-subscribed pl-xs pr-xs">
697,000 students
</span>, <span class="instructor-block__students-subscribed pl-xs pr-xs">
167,500 students
</span>, <span class="instructor-block__students-subscribed pl-xs pr-xs">
145,500 students
</span>, <span class="instructor-block__students-subscribed pl-xs pr-xs">
81,000 students
</span>, <span class="instructor-block__students-subscribed pl-xs pr-xs">
172,000 students
</span>]
您可以使用
re
查找数字:
import re
numbers = [re.sub(',', '', re.findall('[\d,]+', str(i))[0]) for i in soup.select('.pr-xs')]
借助回归函数
import re
students = [re.sub(',', '', re.findall('[\d,]+', str(i))[0]) for i in soup.select('.pr-xs')]
这将为您提供以下结果:
['1184500','697000','167500','145500','81000','172000']
尝试此操作可获得以下结果:
from bs4 import BeautifulSoup
soup = BeautifulSoup(content,"lxml")
data = [item.text.split("students")[0].strip() for item in soup.select('.pr-xs')]
print(data)
输出:
['1,184,500', '697,000', '167,500', '145,500', '81,000', '172,000']
请展示您尝试过的内容或您在尝试中遇到的问题
['1,184,500', '697,000', '167,500', '145,500', '81,000', '172,000']