Python web scaping从javascript端抓取图像

Python web scaping从javascript端抓取图像,python,web,beautifulsoup,Python,Web,Beautifulsoup,我试着从以下图片中抓取图像: 与 matches=soup.find_all('img')) 打印(匹配.text) 我只得到“标题”,但没有链接 我要怎么做才能抓住这个 谢谢 我想应该是这样的: soup.find_all('img').get('src'); 试试这个 from simplified_scrapy import SimplifiedDoc html = '''<img alt="Whoever Blinks First" title="Whoever Blin

我试着从以下图片中抓取图像:


matches=soup.find_all('img'))
打印(匹配.text)
我只得到“标题”,但没有链接 我要怎么做才能抓住这个


谢谢

我想应该是这样的:

soup.find_all('img').get('src'); 
试试这个

from simplified_scrapy import SimplifiedDoc
html = '''<img alt="Whoever Blinks First" title="Whoever Blinks First" src="https://secure2-images.ssl-cdn.com/videoimages/101323/mainLandscape/1568806239343/whoever-blinks-first_1920x1080.webp?nvb=20200209184040&amp;nva=20200215184340&amp;hash=05599031bb49b7c1583cb" srcset="https://secure2-images.ssl-cdn.com/videoimages/101323/mainLandscape/1568806239343/whoever-blinks-first_1920x1080.webp?nvb=20200209184040&amp;nva=20200215184340&amp;hash=05599031bb49b7c1583cb 1x, https://secure2-images-tushy.ssl-cdn.com/videoimages/101323/mainLandscape/1568806239343/tushy-whoever-blinks-first_3840x2160.webp?nvb=20200209184040&amp;nva=20200215184340&amp;hash=072c8e8d26b8a88fd8672 2x" data-test-component="ProgressiveImageImage" class="sc-1egln9q-1 irBaDM">
'''

doc = SimplifiedDoc(html)
imgs = doc.imgs
print (imgs[0])

下面是SimplifiedDoc的一个示例:

find_all()返回一个数组,因此您应该使用find_all('img')[0]或just find('img')。matches=soup.find_all('img')。get('src')File“C:\Program Files\Python38\lib\site packages\bs4\element.py”,第2080行,在getattr raise AttributeError中(AttributeError:ResultSet对象没有属性“get”。您可能将元素列表视为单个元素。当您打算调用find()时是否调用了find_all()?matches=soup.find('img')。get('src')print(len(matches))>>0matches=soup.find('img')print(matches)我得到这个:但是src=“”问题是:图像没有加载。
{
  'alt': 'Whoever Blinks First', 
  'title': 'Whoever Blinks First', 
  'src': 'https: //secure2-images.ssl-cdn.com/videoimages/101323/mainLandscape/1568806239343/whoever-blinks-first_1920x1080.webp?nvb=20200209184040&nva=20200215184340&hash=05599031bb49b7c1583cb', 
  'srcset': 'https://secure2-images.ssl-cdn.com/videoimages/101323/mainLandscape/1568806239343/whoever-blinks-first_1920x1080.webp?nvb=20200209184040&nva=20200215184340&hash=05599031bb49b7c1583cb 1x, https://secure2-images-tushy.ssl-cdn.com/videoimages/101323/mainLandscape/1568806239343/tushy-whoever-blinks-first_3840x2160.webp?nvb=20200209184040&nva=20200215184340&hash=072c8e8d26b8a88fd8672 2x', 
  'data-test-component': 'ProgressiveImageImage', 
  'class': 'sc-1egln9q-1 irBaDM', 
  'tag': 'img'
}