Python 从按钮标记获取值
我正在尝试使用BeautifulSoup解决页面上的分页提取问题 我设法用它来扣扣子Python 从按钮标记获取值,python,button,beautifulsoup,Python,Button,Beautifulsoup,我正在尝试使用BeautifulSoup解决页面上的分页提取问题 我设法用它来扣扣子 soup.findAll('button', class_='SomeName') 你能给我这个吗 <button class="SomeName" data-page="2" type="button">2</button>, <button class="SomeName" data-page="3" type="button">3</button>, &l
soup.findAll('button', class_='SomeName')
你能给我这个吗
<button class="SomeName" data-page="2" type="button">2</button>, <button class="SomeName" data-page="3" type="button">3</button>, <button class="SomeName" data-page="4" type="button">4</button>, <button class="SomeName" data-page="5" type="button">5</button>, <button class="SomeName" data-page="6" type="button">6</button>, <button class="SomeName" data-page="7" type="button">7-12</button>
您可以获取按钮标记之间的文本:
from bs4 import BeautifulSoup as soup
html = '<button class="SomeName" data-page="2" type="button">2</button>, <button class="SomeName" data-page="3" type="button">3</button>, <button class="SomeName" data-page="4" type="button">4</button>, <button class="SomeName" data-page="5" type="button">5</button>, <button class="SomeName" data-page="6" type="button">6</button>, <button class="SomeName" data-page="7" type="button">7-12</button>'
result = [i.text for i in soup(html, 'html.parser').find_all('button')]
您还可以使用css选择器
output = [button.text for button in soup.select('button.SomeName')]
正是我需要的<代码>数据=[soup.findAll('button',class='SomeName')中i的i.text]
['2', '3', '4', '5', '6', '7-12']
output = [button.text for button in soup.select('button.SomeName')]