Python 如何获取第一个`<;李>;`从`<;ul>;`与美联

Python 如何获取第一个`<;李>;`从`<;ul>;`与美联,python,html,beautifulsoup,Python,Html,Beautifulsoup,如下图所示 <div class="carousel"> <div class="carousel_Wrapper"> <div class="carousel_Container swiper-container"> <ul class="swiper-wrapper"> <li class="swiper-slide"> <figure><img

如下图所示

<div class="carousel"> 
  <div class="carousel_Wrapper"> 
    <div class="carousel_Container swiper-container"> 
      <ul class="swiper-wrapper">
        <li class="swiper-slide"> 
          <figure><img alt="" src="https://s3.amazonaws.com/0001.jpg"/></figure>
        </li>
        <li class="swiper-slide"> 
          <figure><img alt="" src="https://s3.amazonaws.com/0002.jpg"/></figure>
        </li>
        <li class="swiper-slide"> 
          <figure><img alt="" src="https://s3.amazonaws.com/0003.jpg"/></figure>
        </li>
      </ul>
    </div>
    <div class="carousel_NextBtn"></div> 
    <div class="carousel_PrevBtn"></div> 
  </div> 
</div>

<div class="carousel"> 
  <div class="carousel_Wrapper"> 
    <div class="carousel_Container swiper-container"> 
      <ul class="swiper-wrapper">
        <li class="swiper-slide"> 
          <figure><img alt="" src="https://s3.amazonaws.com/0004.jpg"/></figure>
        </li>
        <li class="swiper-slide"> 
          <figure><img alt="" src="https://s3.amazonaws.com/0005.jpg"/></figure>
        </li>
        <li class="swiper-slide"> 
          <figure><img alt="" src="https://s3.amazonaws.com/0006.jpg"/></figure>
        </li>
      </ul>
    </div>
    <div class="carousel_NextBtn"></div> 
    <div class="carousel_PrevBtn"></div> 
  </div> 
</div>
当应用上述处理时,我认为将生成如下所示的html

<ul class="swiper-wrapper">
  <li class="swiper-slide"> 
    <figure><img alt="" src="https://s3.amazonaws.com/0001.jpg"/></figure>
  </li>
  <li class="swiper-slide"> 
    <figure><img alt="" src="https://s3.amazonaws.com/0002.jpg"/></figure>
  </li>
  <li class="swiper-slide"> 
    <figure><img alt="" src="https://s3.amazonaws.com/0003.jpg"/></figure>
  </li>
</ul>
<div class="carousel_PrevBtn"></div> 

<ul class="swiper-wrapper">
  <li class="swiper-slide"> 
    <figure><img alt="" src="https://s3.amazonaws.com/0004.jpg"/></figure>
  </li>
    <li class="swiper-slide"> 
  <figure><img alt="" src="https://s3.amazonaws.com/0005.jpg"/></figure>
  </li>
    <li class="swiper-slide"> 
  <figure><img alt="" src="https://s3.amazonaws.com/0006.jpg"/></figure>
  </li>
</ul>
<div class="carousel_PrevBtn"></div> 
我们认为必要的处理如下

  • 1.检索每个

  • 2.插入



    对于2,我认为更换没有问题。
    但是我不知道如何实现1。

    请说明解决方法
html=“”
""" 从bs4导入BeautifulSoup soup=BeautifulSoup(html,'html.parser') all_div=soup.find_all('ul',{'class':'swiper wrapper'})#查找具有指定类的所有ul标记 对于所有分区中的标记: 打印('--------------迭代:'+str(所有分区索引(标记))+'--------------) print(tag.find('li',{'class':'swiper slide'}))#此方法仅在项目具有类时有效 print(tag.contents[1])#如果项目没有类,此方法也可以使用
“检索每个
的第一个
  • 元素的内容”的解决方案可以实现,如上面的代码所示。第二个问题你没有遇到任何问题,所以我还没有发布。如果你在这方面需要任何帮助,请告诉我

    html = # First mentioned html
    
    content = BeautifulSoup(html)
    
    content.find('div', class_='carousel_NextBtn').decompose()
    content.find('div', class_='carousel').unwrap()
    content.find('div', class_='carousel_Wrapper').unwrap()
    content.find('div', class_='carousel_Container swiper-container').unwrap()
    
    <ul class="swiper-wrapper">
      <li class="swiper-slide"> 
        <figure><img alt="" src="https://s3.amazonaws.com/0001.jpg"/></figure>
      </li>
      <li class="swiper-slide"> 
        <figure><img alt="" src="https://s3.amazonaws.com/0002.jpg"/></figure>
      </li>
      <li class="swiper-slide"> 
        <figure><img alt="" src="https://s3.amazonaws.com/0003.jpg"/></figure>
      </li>
    </ul>
    <div class="carousel_PrevBtn"></div> 
    
    <ul class="swiper-wrapper">
      <li class="swiper-slide"> 
        <figure><img alt="" src="https://s3.amazonaws.com/0004.jpg"/></figure>
      </li>
        <li class="swiper-slide"> 
      <figure><img alt="" src="https://s3.amazonaws.com/0005.jpg"/></figure>
      </li>
        <li class="swiper-slide"> 
      <figure><img alt="" src="https://s3.amazonaws.com/0006.jpg"/></figure>
      </li>
    </ul>
    <div class="carousel_PrevBtn"></div> 
    
    html = """<div class="carousel"> 
      <div class="carousel_Wrapper"> 
        <div class="carousel_Container swiper-container"> 
          <ul class="swiper-wrapper">
            <li class="swiper-slide"> 
              <figure><img alt="" src="https://s3.amazonaws.com/0001.jpg"/></figure>
            </li>
            <li class="swiper-slide"> 
              <figure><img alt="" src="https://s3.amazonaws.com/0002.jpg"/></figure>
            </li>
            <li class="swiper-slide"> 
              <figure><img alt="" src="https://s3.amazonaws.com/0003.jpg"/></figure>
            </li>
          </ul>
        </div>
        <div class="carousel_NextBtn"></div> 
        <div class="carousel_PrevBtn"></div> 
      </div> 
    </div>
    
    <div class="carousel"> 
      <div class="carousel_Wrapper"> 
        <div class="carousel_Container swiper-container"> 
          <ul class="swiper-wrapper">
            <li class="swiper-slide"> 
              <figure><img alt="" src="https://s3.amazonaws.com/0004.jpg"/></figure>
            </li>
            <li class="swiper-slide"> 
              <figure><img alt="" src="https://s3.amazonaws.com/0005.jpg"/></figure>
            </li>
            <li class="swiper-slide"> 
              <figure><img alt="" src="https://s3.amazonaws.com/0006.jpg"/></figure>
            </li>
          </ul>
        </div>
        <div class="carousel_NextBtn"></div> 
        <div class="carousel_PrevBtn"></div> 
      </div> 
    </div>"""
    
    from bs4 import BeautifulSoup
    soup = BeautifulSoup(html, 'html.parser')
    all_div = soup.find_all('ul', {'class': 'swiper-wrapper'})  # find all ul tag with specified class
    for tag in all_div:
        print('-------------------- iteration : ' + str(all_div.index(tag)) + ' --------------------')
        print(tag.find('li', {'class': 'swiper-slide'}))  # this method works only if your item has class
        print(tag.contents[1])  # this method will also work if your item don't have a class