Python beautifulsoup抓取可见网页文本，但不包含以.html结尾的文件_Python_Beautifulsoup

Python beautifulsoup抓取可见网页文本，但不包含以.html结尾的文件

python

Python beautifulsoup抓取可见网页文本，但不包含以.html结尾的文件,python,beautifulsoup,Python,Beautifulsoup,我喜欢这一页上的答案：但我的页面不是以.html结尾，而是：必须有一个简单的解决办法 Cheers是URL，而不是文件名。去你的网站，下载源代码，它将在html 你的URL有输入错误，应该是此脚本将使用get\u text（）方法打印可打印文本： import requests from bs4 import BeautifulSoup url = 'https://biomagscience.net/' soup = BeautifulSoup(requests.get(url).te

我喜欢这一页上的答案：

但我的页面不是以.html结尾，而是：

必须有一个简单的解决办法

Cheers

是URL，而不是文件名。去你的网站，下载源代码，它将在html

你的URL有输入错误，应该是此脚本将使用

get\u text（）

方法打印可打印文本：

import requests
from bs4 import BeautifulSoup

url = 'https://biomagscience.net/'
soup = BeautifulSoup(requests.get(url).text, 'lxml')

for tag in soup.select('style, script, [style*="display:none"]'):
    tag.extract()

print(soup.get_text(strip=True, separator='\n'))

印刷品：

Best Magnets For Healing | Biomagnetic Therapy Products
The Future of Health & Well-Being —Today!
Advanced Therapy for Vitality, Nerve Regeneration & Pain Relief of Acute/Chronic Injuries & Illness
Acute Injuries
•
Alzheimer’s
•
Arthritis
•
Back Pain
•
Chronic Illness
•
EMF
•
Joint Pain
•
Muscle Pain
Magnet Therapy Articles
•
Products
BiomagScience

...and so on.

该网站上的DNS查找失败。你是说？