Python 从按钮标记获取值

Python 从按钮标记获取值,python,button,beautifulsoup,Python,Button,Beautifulsoup,我正在尝试使用BeautifulSoup解决页面上的分页提取问题 我设法用它来扣扣子 soup.findAll('button', class_='SomeName') 你能给我这个吗 <button class="SomeName" data-page="2" type="button">2</button>, <button class="SomeName" data-page="3" type="button">3</button>, &l

我正在尝试使用BeautifulSoup解决页面上的分页提取问题

我设法用它来扣扣子

soup.findAll('button', class_='SomeName')
你能给我这个吗

<button class="SomeName" data-page="2" type="button">2</button>, <button class="SomeName" data-page="3" type="button">3</button>, <button class="SomeName" data-page="4" type="button">4</button>, <button class="SomeName" data-page="5" type="button">5</button>, <button class="SomeName" data-page="6" type="button">6</button>, <button class="SomeName" data-page="7" type="button">7-12</button>

您可以获取按钮标记之间的文本:

from bs4 import BeautifulSoup as soup
html = '<button class="SomeName" data-page="2" type="button">2</button>, <button class="SomeName" data-page="3" type="button">3</button>, <button class="SomeName" data-page="4" type="button">4</button>, <button class="SomeName" data-page="5" type="button">5</button>, <button class="SomeName" data-page="6" type="button">6</button>, <button class="SomeName" data-page="7" type="button">7-12</button>'
result = [i.text for i in soup(html, 'html.parser').find_all('button')]

您还可以使用css选择器

output = [button.text for button in soup.select('button.SomeName')]

正是我需要的<代码>数据=[soup.findAll('button',class='SomeName')中i的i.text]
['2', '3', '4', '5', '6', '7-12']
output = [button.text for button in soup.select('button.SomeName')]