Python 使用beautifulsoup和selenium时出现意外错误
我正在做一个需要从网页中提取数据的项目。 首先,我使用以下代码来构建我的系统。注意-我正在使用PyCharm IDE和chromePython 使用beautifulsoup和selenium时出现意外错误,python,selenium,beautifulsoup,Python,Selenium,Beautifulsoup,我正在做一个需要从网页中提取数据的项目。 首先,我使用以下代码来构建我的系统。注意-我正在使用PyCharm IDE和chrome from bs4 import BeautifulSoup from selenium import webdriver driver = webdriver.Chrome() driver.get('https://www.snapdeal.com/product/skycandle-bfm-single-bucket-mop/684185386697#bcrum
from bs4 import BeautifulSoup
from selenium import webdriver
driver = webdriver.Chrome()
driver.get('https://www.snapdeal.com/product/skycandle-bfm-single-bucket-mop/684185386697#bcrumbSearch:skycandle')
html = driver.page_source
soup = BeautifulSoup(html)
print(soup.prettify())
一个Chrome实例将打开我在代码中提到的页面。但是“.prettify()”的预期输出没有出现。
我得到以下错误-
*
*
C:\Users\kp\AppData\Local\Programs\Python\Python36-32\Python.exe C:/Users/kp/PycharmProjects/scrap\u selenium/prog.py
回溯(最近一次呼叫最后一次):
文件“C:/Users/kp/PycharmProjects/scrap_selenium/prog.py”,第28行,in
html=driver.page\u源
文件“C:\Users\kp\AppData\Local\Programs\Python\Python36-32\lib\site packages\selenium\webdriver\remote\webdriver.py”,第500行,在第页\u source
返回self.execute(Command.GET_PAGE_SOURCE)['value']
文件“C:\Users\kp\AppData\Local\Programs\Python\Python36-32\lib\site packages\selenium\webdriver\remote\webdriver.py”,第237行,在execute中
self.error\u handler.check\u响应(响应)
文件“C:\Users\kp\AppData\Local\Programs\Python\Python36-32\lib\site packages\selenium\webdriver\remote\errorhandler.py”,第194行,在check\u响应中
引发异常类(消息、屏幕、堆栈跟踪)
selenium.common.exceptions.NoSuchWindowException:消息:无此类窗口:目标窗口已关闭
来自未知错误:找不到web视图
(会话信息:chrome=56.0.2924.87)
(驱动程序信息:chromedriver=2.27.440174(E97A722CAFC2D3A8B807EE115BFB307F7D2CFD9),平台=Windows NT 10.0.14393 x8664)
在中忽略异常:
回溯(最近一次呼叫最后一次):
文件“C:\Users\kp\AppData\Local\Programs\Python\Python36-32\lib\site packages\selenium\webdriver\common\service.py”,第162行,在__
文件“C:\Users\kp\AppData\Local\Programs\Python\Python36-32\lib\site packages\selenium\webdriver\common\service.py”,第138行,在stop中
文件“C:\Users\kp\AppData\Local\Programs\Python\Python36-32\lib\site packages\selenium\webdriver\common\service.py”,第109行,在send\u remote\u shutdown\u命令中
ImportError:sys.meta_路径为None,Python可能正在关闭
进程已完成,退出代码为1
*
*
请纠正 您的脚本工作正常,因此问题出在其他地方。打印完成后,是否可以重试添加
driver.close()
或driver.quit()
?但问题仍然存在。问题似乎存在。尝试使用Firefox驱动程序或Phantom JS而不是Chrome()将尝试。谢谢
C:\Users\kp\AppData\Local\Programs\Python\Python36-32\python.exe C:/Users/kp/PycharmProjects/scrap_selenium/prog.py
Traceback (most recent call last):
File "C:/Users/kp/PycharmProjects/scrap_selenium/prog.py", line 28, in <module>
html = driver.page_source
File "C:\Users\kp\AppData\Local\Programs\Python\Python36-32\lib\site-packages\selenium\webdriver\remote\webdriver.py", line 500, in page_source
return self.execute(Command.GET_PAGE_SOURCE)['value']
File "C:\Users\kp\AppData\Local\Programs\Python\Python36-32\lib\site-packages\selenium\webdriver\remote\webdriver.py", line 237, in execute
self.error_handler.check_response(response)
File "C:\Users\kp\AppData\Local\Programs\Python\Python36-32\lib\site-packages\selenium\webdriver\remote\errorhandler.py", line 194, in check_response
raise exception_class(message, screen, stacktrace)
selenium.common.exceptions.NoSuchWindowException: Message: no such window: target window already closed
from unknown error: web view not found
(Session info: chrome=56.0.2924.87)
(Driver info: chromedriver=2.27.440174 (e97a722caafc2d3a8b807ee115bfb307f7d2cfd9),platform=Windows NT 10.0.14393 x86_64)
Exception ignored in: <bound method Service.__del__ of <selenium.webdriver.chrome.service.Service object at 0x0353C810>>
Traceback (most recent call last):
File "C:\Users\kp\AppData\Local\Programs\Python\Python36-32\lib\site-packages\selenium\webdriver\common\service.py", line 162, in __del__
File "C:\Users\kp\AppData\Local\Programs\Python\Python36-32\lib\site-packages\selenium\webdriver\common\service.py", line 138, in stop
File "C:\Users\kp\AppData\Local\Programs\Python\Python36-32\lib\site-packages\selenium\webdriver\common\service.py", line 109, in send_remote_shutdown_command
ImportError: sys.meta_path is None, Python is likely shutting down
Process finished with exit code 1