使用Beautifulsoup在Python中获取noscript标记内的数据
我正在尝试检索noscript标记中的一些数据。我可以从标记中获取所有数据,但我似乎不知道如何进一步解析它 HTML 输出:使用Beautifulsoup在Python中获取noscript标记内的数据,python,html,beautifulsoup,Python,Html,Beautifulsoup,我正在尝试检索noscript标记中的一些数据。我可以从标记中获取所有数据,但我似乎不知道如何进一步解析它 HTML 输出: <noscript><img alt="Monogram Accessories Key Holders and Bag Charms Key Pouch | Louis Vuitton ®" src="https://us.louisvuitton.com/images/is/image/lv/1/PP_VP_L/loui
<noscript><img alt="Monogram Accessories Key Holders and Bag Charms Key Pouch | Louis Vuitton ®" src="https://us.louisvuitton.com/images/is/image/lv/1/PP_VP_L/louis-vuitton-key-pouch-monogram-key-holders-and-bag-charms--M62650_PM2_Front%20view.jpg"/></noscript>
您需要在
noscript
标记中找到img
标记,然后获取src
属性
noscript = page_soup.find("noscript")
if noscript:
img = noscript.find("img")
if img:
img_url = img['src']
谢谢你的回答,这让我更接近我的解决方案。不幸的是,
img\u url
返回为None。很抱歉,它应该是img['src']
<noscript><img alt="Monogram Accessories Key Holders and Bag Charms Key Pouch | Louis Vuitton ®" src="https://us.louisvuitton.com/images/is/image/lv/1/PP_VP_L/louis-vuitton-key-pouch-monogram-key-holders-and-bag-charms--M62650_PM2_Front%20view.jpg"/></noscript>
"https://us.louisvuitton.com/images/is/image/lv/1/PP_VP_L/louis-vuitton-key-pouch-monogram-key-holders-and-bag-charms--M62650_PM2_Front%20view.jpg"
noscript = page_soup.find("noscript")
if noscript:
img = noscript.find("img")
if img:
img_url = img['src']