Python 如何使用Beautiful Soup在一个查询中获得多个类
我想在下面的html中找到class=“s”或class=“sb”的tdPython 如何使用Beautiful Soup在一个查询中获得多个类,python,html-parsing,beautifulsoup,Python,Html Parsing,Beautifulsoup,我想在下面的html中找到class=“s”或class=“sb”的td <tr bgcolor="#e5e5f3"><td class="sb" width="200" align="left">test1</td><td class="sb" align="right">5,774.0</td><td class="sb" align="right">4,481.0</td><td class="sb"
<tr bgcolor="#e5e5f3"><td class="sb" width="200" align="left">test1</td><td class="sb" align="right">5,774.0</td><td class="sb" align="right">4,481.0</td><td class="sb" align="right">5,444.0</td><td class="sb" align="right">6,615.0</td><td class="sb" align="right">6,858.0</td></tr>
<tr bgcolor="#f0f0E7"><td class="s" width="200" align="left">test2</td><td class="s" align="right">5,774.0</td><td class="s" align="right">4,481.0</td><td class="s" align="right">5,444.0</td><td class="s" align="right">6,615.0</td><td class="s" align="right">6,858.0</td></tr>
您可以使用Beauty soup对正则表达式的支持来实现这一点
import re
soup = BeautifulSoup(urllib2.urlopen(url).read(),"lxml");
for item in soup.find_all("td", { "class" : re.compile(r"^(s|sb)$") })
此正则表达式匹配:
-字符串的开头^
-字符串(s|sb)
或字符串's'
'sb'
-字符串的结尾$
import re
soup = BeautifulSoup(urllib2.urlopen(url).read(),"lxml");
for item in soup.find_all("td", { "class" : re.compile(r"^(s|sb)$") })