Python 使用BeautifulSoup清除空结果_Python_Beautifulsoup_Scrapy

Python 使用BeautifulSoup清除空结果

python scrapy

Python 使用BeautifulSoup清除空结果,python,beautifulsoup,scrapy,Python,Beautifulsoup,Scrapy,我正在使用BeautifulSoup抓取数据，但在选择任何标记时都会得到一个空结果，下面是我的代码 # -*- coding: utf-8 -*- import requests import sys from bs4 import BeautifulSoup headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/56.0.

我正在使用BeautifulSoup抓取数据，但在选择任何标记时都会得到一个空结果，下面是我的代码

# -*- coding: utf-8 -*-
import requests
import sys
from bs4 import BeautifulSoup

headers = {
    'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/56.0.2924.76 Safari/537.36'}

boardgame_url = "https://boardgamegeek.com/boardgame/285961/heaven-ale-kegs-more"
page = requests.get(boardgame_url, headers=headers)
soup = BeautifulSoup(page.content.decode('utf-8'), 'html.parser')

product_title = soup.select('h1 a')
print(product_title)

使用

beautifulsou

进行刮削时，某些动态内容不会呈现。你们在浏览器中看到的和你们的刮刀得到的明显不同。（您可以导出page.content并进行比较）。您需要一个不同的模块，如

selenium

或

requesthtml

，可以处理动态内容

如果您想要从网页中获取一些数据，那么可以获取java脚本变量并根据需要提取数据。

您可以查看堆栈溢出以获取进一步帮助

您可以在此图像中查看。看起来是使用JS从API加载动态网站内容。我已经打印了汤，但它与在浏览器中看到的不同。selenium会解决这个问题吗？是的，selenium会解决这个问题，因为它使用浏览器来呈现javascript，然后您可以获得html文档