Python 仅在H2内刮取内容物-美化组
我有这个标记Python 仅在H2内刮取内容物-美化组,python,python-3.x,beautifulsoup,Python,Python 3.x,Beautifulsoup,我有这个标记 <h2> Virtual Office packages <span>From</span><span class="cost">$74.97</span> </h2> 我只想获得虚拟办公软件包而不是价格信息 我可以在第二种情况下得到它,但是如果HTML像第一种一样被找到,并且有价格信息,我会得到None 确切地说,我如何仅获取带有任一类型HTMLs的“虚拟办公软件包” soup = Beautifu
<h2>
Virtual Office packages
<span>From</span><span class="cost">$74.97</span>
</h2>
我只想获得虚拟办公软件包
而不是价格信息
我可以在第二种情况下得到它,但是如果HTML像第一种一样被找到,并且有价格信息,我会得到None
确切地说,我如何仅获取带有任一类型HTMLs的“虚拟办公软件包”
soup = BeautifulSoup("""
<h2>
Virtual Office packages
<span>From</span><span class="cost">$74.97</span>
</h2>""")
print(soup.find("h2").next_element.strip())
Virtual Office packages
soup = BeautifulSoup("""
<h2>Virtual Office packages</h2>
""")
print(soup.find("h2").next_element)
Virtual Office packages
soup = BeautifulSoup("""
<h2>
Virtual Office packages
<span>From</span><span class="cost">$74.97</span>
</h2>""")
print(soup.find("h2").contents[0].strip())
Virtual Office packages
soup = BeautifulSoup("""
<h2>Virtual Office packages</h2>
""")
print soup.find("h2").contents[0]
Virtual Office packages
虚拟办公软件包
从74.97美元起
""")
打印(soup.find(“h2”).next_element.strip())
虚拟办公软件包
汤=美汤
虚拟办公软件包
""")
打印(soup.find(“h2”)。下一个元素)
虚拟办公软件包
汤=美汤
虚拟办公软件包
从74.97美元起
""")
打印(soup.find(“h2”).contents[0].strip())
虚拟办公软件包
汤=美汤
虚拟办公软件包
""")
打印汤。查找(“h2”)。内容[0]
虚拟办公软件包
service_header = service.select("h2")[0].string
soup = BeautifulSoup("""
<h2>
Virtual Office packages
<span>From</span><span class="cost">$74.97</span>
</h2>""")
print(soup.find("h2").next_element.strip())
Virtual Office packages
soup = BeautifulSoup("""
<h2>Virtual Office packages</h2>
""")
print(soup.find("h2").next_element)
Virtual Office packages
soup = BeautifulSoup("""
<h2>
Virtual Office packages
<span>From</span><span class="cost">$74.97</span>
</h2>""")
print(soup.find("h2").contents[0].strip())
Virtual Office packages
soup = BeautifulSoup("""
<h2>Virtual Office packages</h2>
""")
print soup.find("h2").contents[0]
Virtual Office packages