Python 我需要创建一个新字符串,内容是一个网站的标题

Python 我需要创建一个新字符串,内容是一个网站的标题,python,web,Python,Web,我认为类似(1)的东西可以工作,但它抛出了一个错误。有什么想法或建议吗 (一) 错误: Traceback (most recent call last): File "python", line 132, in <module> File "src/lxml/etree.pyx", line 3426, in lxml.etree.parse File "src/lxml/parser.pxi", line 1839, in lxml.etree._parseDocum

我认为类似(1)的东西可以工作,但它抛出了一个错误。有什么想法或建议吗

(一)

错误:

Traceback (most recent call last):
  File "python", line 132, in <module>
  File "src/lxml/etree.pyx", line 3426, in lxml.etree.parse
  File "src/lxml/parser.pxi", line 1839, in lxml.etree._parseDocument
  File "src/lxml/parser.pxi", line 1865, in lxml.etree._parseDocumentFromURL
  File "src/lxml/parser.pxi", line 1769, in lxml.etree._parseDocFromFile
  File "src/lxml/parser.pxi", line 1162, in lxml.etree._BaseParser._parseDocFromFile
  File "src/lxml/parser.pxi", line 600, in lxml.etree._ParserContext._handleParseResultDoc
  File "src/lxml/parser.pxi", line 710, in lxml.etree._handleParseResult
  File "src/lxml/parser.pxi", line 637, in lxml.etree._raiseParseError
OSError: Error reading file 'bazorkversion--grify.repl.co': failed to load external entity "bazorkversion--grify.repl.co"
回溯(最近一次呼叫最后一次):
文件“python”,第132行,在
lxml.etree.parse中的文件“src/lxml/etree.pyx”,第3426行
文件“src/lxml/parser.pxi”,第1839行,在lxml.etree.\u parseDocument中
文件“src/lxml/parser.pxi”,第1865行,在lxml.etree中。\u parseDocumentFromURL
文件“src/lxml/parser.pxi”,第1769行,位于lxml.etree.\u parseDocFromFile
文件“src/lxml/parser.pxi”,第1162行,在lxml.etree.\u BaseParser.\u parseDocFromFile中
文件“src/lxml/parser.pxi”,第600行,在lxml.etree.\u ParserContext.\u handleParseResultDoc
lxml.etree中的文件“src/lxml/parser.pxi”,第710行
lxml.etree中的文件“src/lxml/parser.pxi”,第637行
OSError:读取文件“bazorkversion--grify.repl.co”时出错:未能加载外部实体“bazorkversion--grify.repl.co”
标题如下:

标题是字符串“PreAlpha 3” (它出现在浏览器选项卡顶部,站点favicon旁边)

,可能是lxml中的故障

相反,也许可以尝试使用另一个web抓取模块,如
BeautifulSoup
,以及
requests
模块来接收来自URL的请求:

>>> import requests
>>> from bs4 import BeautifulSoup as BS
>>> r = requests.get('https://bazorkversion--grify.repl.co/')
>>> soup = BS(r.text, 'lxml')
>>> soup.title.text
'PreAlpha 3'

你说的“头衔”是什么意思?URL的标题?有一个名为title的标签吗?也许提供一个有效的URL作为示例将我的错误添加到正文中
>>> import requests
>>> from bs4 import BeautifulSoup as BS
>>> r = requests.get('https://bazorkversion--grify.repl.co/')
>>> soup = BS(r.text, 'lxml')
>>> soup.title.text
'PreAlpha 3'