python web抓取,提取标记的内部元素

python web抓取,提取标记的内部元素,python,web-scraping,Python,Web Scraping,我想从一个在线购物网站上获取产品和价格,需要帮助提取标签之间的字符串 import bs4 from urllib.request import urlopen from bs4 import BeautifulSoup as soup my_url='https://www.flipkart.com/cameras/mirrorless~type/pr?sid=jek%2Cp31' cl=urlopen(my_url) page_html=cl.read() ps=soup(page_html

我想从一个在线购物网站上获取产品和价格,需要帮助提取标签之间的字符串

import bs4
from urllib.request import urlopen
from bs4 import BeautifulSoup as soup
my_url='https://www.flipkart.com/cameras/mirrorless~type/pr?sid=jek%2Cp31'
cl=urlopen(my_url)
page_html=cl.read()
ps=soup(page_html,'html5lib')
ps1=(ps.prettify())
cn=ps.findAll('div',{'class':'_1-2Iqu row'})
len(cn)                     
cn[0].div.div

#output-"<div class="_3wU53n">Canon M50 Mirrorless Camera Body with Single Lens EF-M 15-45 mm ISSTM</div>
#i need Canon M50 Mirrorless Camera Body with Single Lens EF-M 15-45 mm ISSTM
导入bs4
从urllib.request导入urlopen
从bs4进口美汤作为汤
我的https://www.flipkart.com/cameras/mirrorless~type/pr?sid=jek%2Cp31'
cl=urlopen(我的url)
page_html=cl.read()
ps=soup(页面为html,'html5lib')
ps1=(ps.prettify())
cn=ps.findAll('div',{'class':''u 1-2Iqu row'})
莱恩(中国)
cn[0].div.div
#输出-“佳能M50无镜相机机身,单镜头EF-M 15-45毫米ISSTM
#我需要佳能M50无镜相机机身,单镜头EF-M 15-45毫米ISSTM
将cn=ps.findAll('div',{'class':'u 1-2Iqu row'})替换为cn=ps.findAll('div',{'class':'u 1-2Iqu row'},text=True)

将cn=ps.findAll('div',{'class':'u 1-2Iqu row'})替换为cn=ps.findAll('div',{'class':'u 1-2Iqu row'},text=True)