Python 如何从Amazon网页列表中提取图像的URL?
列表中大约有1100个亚马逊网页的URL。它适用于某些初始web页面,但在此之后,它会抛出错误:HTTP错误503:服务不可用尝试此操作,以访问元素属性Python 如何从Amazon网页列表中提取图像的URL?,python,html,beautifulsoup,Python,Html,Beautifulsoup,列表中大约有1100个亚马逊网页的URL。它适用于某些初始web页面,但在此之后,它会抛出错误:HTTP错误503:服务不可用尝试此操作,以访问元素属性 import pandas as pd listt=[list of web pages] from urllib.request import urlopen from bs4 import BeautifulSoup import re imgss = [] for i in range(len(listt)): print(
import pandas as pd
listt=[list of web pages]
from urllib.request import urlopen
from bs4 import BeautifulSoup
import re
imgss = []
for i in range(len(listt)):
print(i)
html = urlopen(listt[i])
bs = BeautifulSoup(html, 'html.parser')
images = bs.find_all('img', {'src':re.compile('PATTERN.jpg')})
for image in images:
imgss.append(image['src'])
它抛出错误?你能加上什么样的错误吗?
imgss.append(image.attrs['src'])