使用beautifulsoup在python中构建webcrawler时出现警告_Python_Beautifulsoup_Web Crawler_Pyspider

使用beautifulsoup在python中构建webcrawler时出现警告

python web-crawler

使用beautifulsoup在python中构建webcrawler时出现警告,python,beautifulsoup,web-crawler,pyspider,Python,Beautifulsoup,Web Crawler,Pyspider,我正在尝试构建一个简单的网络爬虫程序，如果搜索的关键字是“legion”，它会给出amazon.in上显示的每个legion产品的URL。我正在使用以下代码： import requests from bs4 import BeautifulSoup def legion_spider(max_pages): page = 1 while page <= max_pages: url = 'https://www.amazon.in/s?k=legion&a

我正在尝试构建一个简单的网络爬虫程序，如果搜索的关键字是“legion”，它会给出amazon.in上显示的每个legion产品的URL。我正在使用以下代码：

import requests
from bs4 import BeautifulSoup

def legion_spider(max_pages):
    page = 1
    while page <= max_pages:
       url = 'https://www.amazon.in/s?k=legion&qid=1588862016&swrs=82DF79C1243AF6D61651CCAA9F883EC4&ref=sr_pg_'+ str(page)
       source_code = requests.get(url)
       plain_txt = source_code.text
       soup = BeautifulSoup(plain_txt)
       for link in soup.findAll('a',{'class': 'a-size-medium a-color-base a-text-normal'}):
           href = link.get('href')
           print(href)
       page += 1


legion_spider(1)

你错过了解析器！跟着这一部分走

美化组（标记）

您缺少解析器！跟着这一部分走

美化组（标记）

C:\Users\lenovo\AppData\Local\Programs\Python\Python38-32\python.exe "E:/Python Practice/web_crawler.py"
E:/Python Practice/web_crawler.py:10: UserWarning: No parser was explicitly specified, so I'm using the best available HTML parser for this system ("html.parser"). This usually isn't a problem, but if you run this code on another system, or in a different virtual environment, it may use a different parser and behave differently.

The code that caused this warning is on line 10 of the file E:/Python Practice/web_crawler.py. To get rid of this warning, pass the additional argument 'features="html.parser"' to the BeautifulSoup constructor.

  soup = BeautifulSoup(plain_txt)

Process finished with exit code 0

 BeautifulSoup(markup, <parser>)