Python 使用请求的某些站点上的HTML响应不完整&；硒素_Python_Selenium_Beautifulsoup_Python Requests - Fatal编程技术网

Python 使用请求的某些站点上的HTML响应不完整&；硒素

python selenium

Python 使用请求的某些站点上的HTML响应不完整&；硒素,python,selenium,beautifulsoup,python-requests,Python,Selenium,Beautifulsoup,Python Requests,我打算使用Python中的请求和美化组从一些URL中获取信息。但有些网站只返回部分HTML响应，而没有返回页面内容这是不起作用的代码： import requests from bs4 import BeautifulSoup url = "http://www.exampleurl.com" r = requests.get(url) soup = BeautifulSoup(r.content, 'html.parser') 以下是不完整的回答：我尝试将Selen

我打算使用Python中的请求和美化组从一些URL中获取信息。但有些网站只返回部分HTML响应，而没有返回页面内容

这是不起作用的代码：

import requests
from bs4 import BeautifulSoup
url = "http://www.exampleurl.com"
r = requests.get(url)
soup = BeautifulSoup(r.content, 'html.parser')

以下是不完整的回答：

我尝试将Selenium与Chrome Webdriver结合使用，但最终还是遇到了同样的问题

from selenium import webdriver
options = webdriver.ChromeOptions()
options.add_argument('--ignore-certificate-errors')
options.add_argument('--incognito')
options.add_argument('--headless')
browser = webdriver.Chrome(options=options)
browser.get(url)
html = browser.page_source

有什么想法吗？

发生了什么事

您无法获得预期的html，因为它位于iframe中

尝试获取iframe
soup的src。查找（'iframe'）['src']
并再次请求
示例

import requests from bs4 import BeautifulSoup url = "http://www.ingenieur-jobs.de/jobangebote/3075/" r = requests.get(url) soup = BeautifulSoup(r.content, 'html.parser') iframe = requests.get(soup.find('iframe')['src']) soup = BeautifulSoup(iframe.content, 'html.parser') soup
请看这个

[selenium]相关文章推荐

RemoteWebDriver SetFileDetector提供了一个异常org.openqa.selenium.UnsupportedCommand异常： selenium selenium-webdriver

Selenium 出现模式对话框（警告：服务器未提供任何堆栈跟踪信息） selenium selenium-webdriver

Selenium 蚂蚁找不到symbol@Test selenium ant junit

Selenium WebDriver-定位动态列 selenium

Selenium java中的Web驱动程序和Tor执行 selenium selenium-webdriver phantomjs

单击SeleniumJava列表中的一个元素 selenium

Selenium 如何在桌面上自动推送通知？ selenium push-notification automated-tests

Selenium FireFox仍然无法进行量角器/硒测试？ selenium firefox jasmine protractor

Codeception/Selenium：设置Firefox参数 selenium firefox

Selenium自动化测试是如何按照敏捷软件流程在公司内运行的？ selenium automated-tests

can'；t打开jenkins上的selenium失败测试 selenium jenkins selenium-webdriver

SeleniumWebDriver中的ExtentReports selenium selenium-webdriver

Selenium 无法使用量角器使用firefox v53 selenium protractor

Selenium 打开Chrome时出现问题：与远程浏览器通信时出错。它可能已经死了 selenium

Selenium 根据另一个标记下的文本单击超链接 selenium xpath selenium-webdriver

Selenium 如何处理Puppeter以从打印对话框保存pdf？ selenium automation

Selenium Specflow步骤参数转换正则表达式不'；不匹配 selenium

Selenium 没有帧，但仍然存在未找到的错误元素 selenium selenium-webdriver

为什么Selenium提供Safari 11'；s版本为13605？ selenium selenium-webdriver safari

chrome在dom中找不到selenium的任何元素 selenium

随机文章推荐

Class VB6类是否有析构函数？ class vb6

Class css第n个子类和类 class css

Class 在类构造函数中将可变的.HashMap[ArrayBuffer[Int]]转换为不可变的.HashMap[Vector] class scala graph

Class 合同。确保发生溢出异常 class

Class 定义类：var、val和省略它之间的区别？ class scala

Class 类别列表'；swift中的属性 class properties swift

Class 在F中使用私有构造函数生成类# class f#

Class 我可以为Haxe中的类定义隐式强制转换行为吗？ class

Class 如何命名创建和编辑功能的基类？ class

Class DXE7:“一个”；A型=B型；和var x（类型A）：=A.create导致E2010不兼容的类型编译错误。为什么？ class delphi types

Class 常见Lisp类型与类别区别 class types common-lisp

Class 类型为X的参数必须支持接口Y class delphi generics interface compiler-errors

Class 是否可以在Julia中实现类型工厂而不使用'eval（）`？ class julia

C++；授予访问权限 C++中的P> 授予访问权限时，不允许将变量从受保护降级为公共，但正在发生 #include<iostream> using namespace std; class base { protected: int x; // x is protected }; class derived: private base { public: base::x; //demoting from protected to pu class c++11 inheritance

Class 更改uuu init_uuuu（）中对象的类 class

Class Kotlin上接口实现的继承性 class kotlin inheritance interface

Class 为什么超过了最大触发深度。System.DmlException Upsert失败？ class triggers salesforce

Class 为什么这段代码有效？在这种情况下，Delphi是如何实例化一个类的？ class delphi

Class 在Lua类中使用表变量 class oop lua

Class Dart/FLIFT-访问mixin或派生类中的私有成员 class dart

[python]相关推荐

Python pandas.read\u sql\u表生成未找到表错误
Python Oracle Sqlalchemy

Pymol来自Python，无背景
Python

Python 游戏在游戏中间暂时冻结
Python Windows 8

Python 从列表中获取所有最小元素及其索引
Python List

Python：子列表的分位数
Python Numpy

Python 模型未显示在管理中
Python Django

Hello_World是Python中的一个词，在另一种模式中是不同的。我如何改变它？
Python Emacs

Python 如何仅在django中将外键选择限制为相关对象？
Python Django Django Models

Python 如何在pandas中打印特定列的内容
Python Pandas

Python 获取具有对象或分类数据类型的列名列表
Python Pandas Dataframe Types

Python 查找嵌套数组的形状
Python Numpy

Python 对象没有django的属性“is_hidden”
Python Django Forms Web

Python 如何将已定义共享子字符串的列表中的字符串移动到新列表中？
Python Algorithm List

Python 使用win32com.client打开excel窗口后需要最大化该窗口
Python Excel

Python 激活小部件时无法自动退出平移或缩放模式
Python Matplotlib

Python 在数据框中，用0填充缺失年份/季度的列
Python Pandas

Python“for循环”包含2个列表/变量
Python Python 3.x List

Python 如何正确使用带有级别和轴参数的索引？
Python Pandas Dataframe

Python Plotly：如何在条形图中使用for循环或list for name属性？
Python Pandas

Python/ApacheBeam：如何将文本文件解析为CSV？
Python Google Cloud Dataflow

Python 为什么不把阿姆斯特朗的数字打印到1000？
Python Python 3.x For Loop Math Printing

Python 设置colorbar matplotlib的标签
Python Matplotlib

Python 用第三个列表的值替换一个列表的零的两个列表的元素式总和
Python List

Python 如何将一个未连接的networkx图拆分为多个相互不相交且相互连接的图？
Python

Firestore：使用python列出文档的子集合
Python Firebase Google Cloud Firestore

Python Django/Pandas-创建Excel文件并作为下载
Python Django Pandas

Python Keras风格转移通过平均像素消除零中心
Python Image Processing Keras Deep Learning

如果换行符以数字开头，则拆分python字符串
Python Regex String

使用Python在txt文件中选择随机行
Python Random

Python 当字符串变量的拆分行之间没有间隙时，将它们连接起来
Python String

Tags

View Tcp Azure Data Factory Visual Studio 2017 Terminal Protocol Buffers Influxdb Nosql Ocaml Apache Zookeeper Rx Java Forms Laravel Google Maps Netty Ssh Gps Android Ndk Button Zend Framework Gis Raspberry Pi Java 8 Amazon Dynamodb Search Nsis Kotlin Php Ms Office Elm Jhipster Linux Kernel Responsive Design Command Line Netsuite Django Rest Framework Visual Studio 2008 Documentation Random Spotify Db2 Jms Geolocation Openlayers 3 Asp.net Web Api Visual C++ Loopbackjs Virtualbox Spring Batch Outlook Hive Cordova F# Asp.net Heroku Java Me Alfresco Gitlab Artificial Intelligence Login Linkedin Material Ui Ruby On Rails 3.1 Jqgrid Continuous Integration Parse Platform Jira Corda Drupal 7 Types Ibm Mq File Rally Coding Style Angularjs Xpath Marklogic Mdx Autocomplete Twitter Web Magento Security Cron Joomla Post Graphql Ibm Cloud Time Cygwin Postman Ssrs 2008 Email Odata Windows Mobile Video .net Core Stm32 Jasper Reports Sapui5 Maven 2 Requirejs Yocto Karate Spring Security Floating Point Centos Multithreading C++ Rxjs Encryption Gcc Nunit Ios8 Networking Intellij Idea Seo Azure Functions Properties Robotframework Project Management Vba Chef Infra File Upload Webstorm Azure Ionic Framework Memory Jar Replace Batch File Arrays Enums Eclipse Plugin Windows Phone 8 Google Drive Api Xcode Aem Ldap Twig Sqlalchemy Apache Flex Kubernetes Doxygen Pointers For Loop Dynamic Xampp Functional Programming Graph Scripting Artifactory Tinymce Mips Isabelle Jquery Ui Air Notepad++ Mqtt Permissions Assembly Testng Layout Perl Instagram Rabbitmq Phantomjs Ruby Animation Webpack Arangodb Hybris Omnet++ Node.js Lua Eclipse Rcp Amp Html Arduino Lisp Qml Antlr4 Magento2 Parameters Struct Css Laravel 5 Youtube Syntax Ios6 Ibm Midrange Cocos2d Iphone Zurb Foundation Tags Itext Iphone Dialogflow Es Sencha Touch Azure Devops Calendar Primefaces Debian

Copyright © 2024. All Rights Reserved by - Fatal编程技术网