Python web scaping从javascript端抓取图像
我试着从以下图片中抓取图像:Python web scaping从javascript端抓取图像,python,web,beautifulsoup,Python,Web,Beautifulsoup,我试着从以下图片中抓取图像: 与 matches=soup.find_all('img')) 打印(匹配.text) 我只得到“标题”,但没有链接 我要怎么做才能抓住这个 谢谢 我想应该是这样的: soup.find_all('img').get('src'); 试试这个 from simplified_scrapy import SimplifiedDoc html = '''<img alt="Whoever Blinks First" title="Whoever Blin
与
matches=soup.find_all('img'))
打印(匹配.text)
我只得到“标题”,但没有链接
我要怎么做才能抓住这个
谢谢 我想应该是这样的:
soup.find_all('img').get('src');
试试这个
from simplified_scrapy import SimplifiedDoc
html = '''<img alt="Whoever Blinks First" title="Whoever Blinks First" src="https://secure2-images.ssl-cdn.com/videoimages/101323/mainLandscape/1568806239343/whoever-blinks-first_1920x1080.webp?nvb=20200209184040&nva=20200215184340&hash=05599031bb49b7c1583cb" srcset="https://secure2-images.ssl-cdn.com/videoimages/101323/mainLandscape/1568806239343/whoever-blinks-first_1920x1080.webp?nvb=20200209184040&nva=20200215184340&hash=05599031bb49b7c1583cb 1x, https://secure2-images-tushy.ssl-cdn.com/videoimages/101323/mainLandscape/1568806239343/tushy-whoever-blinks-first_3840x2160.webp?nvb=20200209184040&nva=20200215184340&hash=072c8e8d26b8a88fd8672 2x" data-test-component="ProgressiveImageImage" class="sc-1egln9q-1 irBaDM">
'''
doc = SimplifiedDoc(html)
imgs = doc.imgs
print (imgs[0])
下面是SimplifiedDoc的一个示例:find_all()返回一个数组,因此您应该使用find_all('img')[0]或just find('img')。matches=soup.find_all('img')。get('src')File“C:\Program Files\Python38\lib\site packages\bs4\element.py”,第2080行,在getattr raise AttributeError中(AttributeError:ResultSet对象没有属性“get”。您可能将元素列表视为单个元素。当您打算调用find()时是否调用了find_all()?matches=soup.find('img')。get('src')print(len(matches))>>0matches=soup.find('img')print(matches)我得到这个:但是src=“”问题是:图像没有加载。
{
'alt': 'Whoever Blinks First',
'title': 'Whoever Blinks First',
'src': 'https: //secure2-images.ssl-cdn.com/videoimages/101323/mainLandscape/1568806239343/whoever-blinks-first_1920x1080.webp?nvb=20200209184040&nva=20200215184340&hash=05599031bb49b7c1583cb',
'srcset': 'https://secure2-images.ssl-cdn.com/videoimages/101323/mainLandscape/1568806239343/whoever-blinks-first_1920x1080.webp?nvb=20200209184040&nva=20200215184340&hash=05599031bb49b7c1583cb 1x, https://secure2-images-tushy.ssl-cdn.com/videoimages/101323/mainLandscape/1568806239343/tushy-whoever-blinks-first_3840x2160.webp?nvb=20200209184040&nva=20200215184340&hash=072c8e8d26b8a88fd8672 2x',
'data-test-component': 'ProgressiveImageImage',
'class': 'sc-1egln9q-1 irBaDM',
'tag': 'img'
}