Python 如何获取第一个`<;李>;`从`<;ul>;`与美联
如下图所示Python 如何获取第一个`<;李>;`从`<;ul>;`与美联,python,html,beautifulsoup,Python,Html,Beautifulsoup,如下图所示 <div class="carousel"> <div class="carousel_Wrapper"> <div class="carousel_Container swiper-container"> <ul class="swiper-wrapper"> <li class="swiper-slide"> <figure><img
<div class="carousel">
<div class="carousel_Wrapper">
<div class="carousel_Container swiper-container">
<ul class="swiper-wrapper">
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0001.jpg"/></figure>
</li>
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0002.jpg"/></figure>
</li>
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0003.jpg"/></figure>
</li>
</ul>
</div>
<div class="carousel_NextBtn"></div>
<div class="carousel_PrevBtn"></div>
</div>
</div>
<div class="carousel">
<div class="carousel_Wrapper">
<div class="carousel_Container swiper-container">
<ul class="swiper-wrapper">
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0004.jpg"/></figure>
</li>
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0005.jpg"/></figure>
</li>
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0006.jpg"/></figure>
</li>
</ul>
</div>
<div class="carousel_NextBtn"></div>
<div class="carousel_PrevBtn"></div>
</div>
</div>
当应用上述处理时,我认为将生成如下所示的html
<ul class="swiper-wrapper">
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0001.jpg"/></figure>
</li>
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0002.jpg"/></figure>
</li>
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0003.jpg"/></figure>
</li>
</ul>
<div class="carousel_PrevBtn"></div>
<ul class="swiper-wrapper">
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0004.jpg"/></figure>
</li>
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0005.jpg"/></figure>
</li>
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0006.jpg"/></figure>
</li>
</ul>
<div class="carousel_PrevBtn"></div>
-
-
-
-
-
-
我们认为必要的处理如下
- 1.检索每个
- 2.插入
对于2,我认为更换没有问题。
但是我不知道如何实现1。
请说明解决方法
html=“”
-
-
-
-
-
-
"""
从bs4导入BeautifulSoup
soup=BeautifulSoup(html,'html.parser')
all_div=soup.find_all('ul',{'class':'swiper wrapper'})#查找具有指定类的所有ul标记
对于所有分区中的标记:
打印('--------------迭代:'+str(所有分区索引(标记))+'--------------)
print(tag.find('li',{'class':'swiper slide'}))#此方法仅在项目具有类时有效
print(tag.contents[1])#如果项目没有类,此方法也可以使用
“检索每个
的第一个
元素的内容”的解决方案可以实现,如上面的代码所示。第二个问题你没有遇到任何问题,所以我还没有发布。如果你在这方面需要任何帮助,请告诉我
html = # First mentioned html
content = BeautifulSoup(html)
content.find('div', class_='carousel_NextBtn').decompose()
content.find('div', class_='carousel').unwrap()
content.find('div', class_='carousel_Wrapper').unwrap()
content.find('div', class_='carousel_Container swiper-container').unwrap()
<ul class="swiper-wrapper">
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0001.jpg"/></figure>
</li>
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0002.jpg"/></figure>
</li>
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0003.jpg"/></figure>
</li>
</ul>
<div class="carousel_PrevBtn"></div>
<ul class="swiper-wrapper">
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0004.jpg"/></figure>
</li>
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0005.jpg"/></figure>
</li>
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0006.jpg"/></figure>
</li>
</ul>
<div class="carousel_PrevBtn"></div>
html = """<div class="carousel">
<div class="carousel_Wrapper">
<div class="carousel_Container swiper-container">
<ul class="swiper-wrapper">
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0001.jpg"/></figure>
</li>
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0002.jpg"/></figure>
</li>
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0003.jpg"/></figure>
</li>
</ul>
</div>
<div class="carousel_NextBtn"></div>
<div class="carousel_PrevBtn"></div>
</div>
</div>
<div class="carousel">
<div class="carousel_Wrapper">
<div class="carousel_Container swiper-container">
<ul class="swiper-wrapper">
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0004.jpg"/></figure>
</li>
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0005.jpg"/></figure>
</li>
<li class="swiper-slide">
<figure><img alt="" src="https://s3.amazonaws.com/0006.jpg"/></figure>
</li>
</ul>
</div>
<div class="carousel_NextBtn"></div>
<div class="carousel_PrevBtn"></div>
</div>
</div>"""
from bs4 import BeautifulSoup
soup = BeautifulSoup(html, 'html.parser')
all_div = soup.find_all('ul', {'class': 'swiper-wrapper'}) # find all ul tag with specified class
for tag in all_div:
print('-------------------- iteration : ' + str(all_div.index(tag)) + ' --------------------')
print(tag.find('li', {'class': 'swiper-slide'})) # this method works only if your item has class
print(tag.contents[1]) # this method will also work if your item don't have a class