HTTP错误:403,带漂亮汤的Python
代码如下:HTTP错误:403,带漂亮汤的Python,python,http,urllib,http-error,Python,Http,Urllib,Http Error,代码如下: quote_page = "https://www.timeanddate.com/holidays/fun/" + months [date.month].lower() + "/" + str(date.day) page = urllib2.urlopen(quote_page) soup = BeautifulSoup(page, 'html.parser') event_box = soup.find('article', attrs={"class" : "fixed"}
quote_page = "https://www.timeanddate.com/holidays/fun/" + months [date.month].lower() + "/" + str(date.day)
page = urllib2.urlopen(quote_page)
soup = BeautifulSoup(page, 'html.parser')
event_box = soup.find('article', attrs={"class" : "fixed"})
event_box = event_box.find('h3')
event = event_box.text.strip()
print event
当我检查变量时:
quote_page = https://www.timeanddate.com/holidays/fun/june/8
我试过打印quote_页面,链接在我的浏览器中运行良好,
但当我运行代码并打印“事件”时,我得到了以下结果:
Traceback (most recent call last):
File "main.py", line 252, in <module>
page = urllib2.urlopen(req)
File "/usr/local/lib/python2.7/urllib2.py", line 154, in urlopen
return opener.open(url, data, timeout)
File "/usr/local/lib/python2.7/urllib2.py", line 435, in open
response = meth(req, response)
File "/usr/local/lib/python2.7/urllib2.py", line 548, in http_response
'http', request, response, code, msg, hdrs)
File "/usr/local/lib/python2.7/urllib2.py", line 473, in error
return self._call_chain(*args)
File "/usr/local/lib/python2.7/urllib2.py", line 407, in _call_chain
result = func(*args)
File "/usr/local/lib/python2.7/urllib2.py", line 556, in http_error_default
raise HTTPError(req.get_full_url(), code, msg, hdrs, fp)
urllib2.HTTPError: HTTP Error 403: ok
exit status 1
回溯(最近一次呼叫最后一次):
文件“main.py”,第252行,在
page=urlib2.urlopen(请求)
文件“/usr/local/lib/python2.7/urllib2.py”,urlopen中的第154行
返回opener.open(url、数据、超时)
文件“/usr/local/lib/python2.7/urllib2.py”,第435行,打开
响应=方法(请求,响应)
http_响应中的文件“/usr/local/lib/python2.7/urllib2.py”,第548行
“http”、请求、响应、代码、消息、hdrs)
文件“/usr/local/lib/python2.7/urllib2.py”,第473行出错
返回自我。调用链(*args)
文件“/usr/local/lib/python2.7/urllib2.py”,第407行,在调用链中
结果=func(*args)
文件“/usr/local/lib/python2.7/urllib2.py”,第556行,默认为http\u error\u
raise HTTPError(请求获取完整url(),代码,消息,hdrs,fp)
urllib2.HTTPError:HTTP错误403:ok
退出状态1
我想我记得今天早些时候它起作用了,但后来我再次尝试时停止了
有人能帮忙吗?
(我对编码相当陌生)正如堆栈跟踪清楚地解释的那样,您在
urllib2.urlopen
中有一个错误。脚本第二行之后的任何内容都不会执行。检查HTTP错误403的含义。(提示:你一定是被禁止使用该URL。)我能做些什么来获得“不受约束”吗?这可能与我在一段时间内测试过太多次有关吗?你可能因为频繁访问而被列入黑名单。休息一下,改天再试试。如果仍然不起作用,请尝试联系网站管理员。如果他们不合作,你就注定要失败。