Python Selenium在旋转代理时抛出InvalidArgumentException

Python Selenium在旋转代理时抛出InvalidArgumentException,python,selenium,selenium-webdriver,Python,Selenium,Selenium Webdriver,所以我在GitHub上找到了这段代码,用于从中收集IP并旋转它们。但当我尝试运行它时,收到一条错误消息 我试图调试它,但找不到解决方案。我发现新版本的Chrome Web驱动程序出现了问题 代码如下: from selenium import webdriver from selenium.webdriver.chrome.options import DesiredCapabilities from selenium.webdriver.common.proxy import Proxy, P

所以我在GitHub上找到了这段代码,用于从中收集IP并旋转它们。但当我尝试运行它时,收到一条错误消息

我试图调试它,但找不到解决方案。我发现新版本的Chrome Web驱动程序出现了问题

代码如下:

from selenium import webdriver
from selenium.webdriver.chrome.options import DesiredCapabilities
from selenium.webdriver.common.proxy import Proxy, ProxyType

import time


co = webdriver.ChromeOptions()
co.add_argument("log-level=3")
co.add_argument("--headless")

def get_proxies(co=co):
    driver = webdriver.Chrome(chrome_options=co)
    driver.get("https://free-proxy-list.net/")

    PROXIES = []
    proxies = driver.find_elements_by_css_selector("tr[role='row']")
    for p in proxies:
        result = p.text.split(" ")

        if result[-1] == "yes":
            PROXIES.append(result[0]+":"+result[1])

    driver.close()
    return PROXIES


ALL_PROXIES = get_proxies()


def proxy_driver(PROXIES, co=co):
    prox = Proxy()

    if PROXIES:
        pxy = PROXIES[-1]
    else:
        print("--- Proxies used up (%s)" % len(PROXIES))
        PROXIES = get_proxies()

    prox.proxy_type = ProxyType.MANUAL
    prox.http_proxy = pxy
    prox.socks_proxy = pxy
    prox.ssl_proxy = pxy

    capabilities = webdriver.DesiredCapabilities.CHROME
    prox.add_to_capabilities(capabilities)

    driver = webdriver.Chrome(chrome_options=co, desired_capabilities=capabilities)

    return driver



# --- YOU ONLY NEED TO CARE FROM THIS LINE ---
# creating new driver to use proxy
pd = proxy_driver(ALL_PROXIES)

# code must be in a while loop with a try to keep trying with different proxies
running = True

while running:
    try:
        mycodehere()
        
        # if statement to terminate loop if code working properly
        something()
        
        # you 
    except:
        new = ALL_PROXIES.pop()
        
        # reassign driver if fail to switch proxy
        pd = proxy_driver(ALL_PROXIES)
        print("--- Switched proxy to: %s" % new)
        time.sleep(1)
这是我得到的错误:

Traceback (most recent call last):
  File "test_v1.py", line 53, in <module>
    pd = proxy_driver(ALL_PROXIES)
  File "test_v1.py", line 47, in proxy_driver
    driver = webdriver.Chrome('/home/djurovic/Desktop/Linux ChromeDriver/chromedriver', chrome_options=co, desired_capabilities=capabilities)
  File "/usr/local/lib/python3.6/dist-packages/selenium/webdriver/chrome/webdriver.py", line 81, in __init__
    desired_capabilities=desired_capabilities)
  File "/usr/local/lib/python3.6/dist-packages/selenium/webdriver/remote/webdriver.py", line 157, in __init__
    self.start_session(capabilities, browser_profile)
  File "/usr/local/lib/python3.6/dist-packages/selenium/webdriver/remote/webdriver.py", line 252, in start_session
    response = self.execute(Command.NEW_SESSION, parameters)
  File "/usr/local/lib/python3.6/dist-packages/selenium/webdriver/remote/webdriver.py", line 321, in execute
    self.error_handler.check_response(response)
  File "/usr/local/lib/python3.6/dist-packages/selenium/webdriver/remote/errorhandler.py", line 242, in check_response
    raise exception_class(message, screen, stacktrace)
selenium.common.exceptions.InvalidArgumentException: Message: invalid argument: cannot parse capability: proxy
from invalid argument: Specifying 'socksProxy' requires an integer for 'socksVersion'
  (Driver info: chromedriver=2.45.615279 (12b89733300bd268cff3b78fc76cb8f3a7cc44e5),platform=Linux 4.15.0-43-generic x86_64)
回溯(最近一次呼叫最后一次):
文件“test_v1.py”,第53行,在
pd=代理驱动程序(所有代理)
代理驱动程序中第47行的文件“test_v1.py”
driver=webdriver.Chrome('/home/djurovic/Desktop/Linux ChromeDriver/ChromeDriver',Chrome\u options=co,所需的功能=功能)
文件“/usr/local/lib/python3.6/dist-packages/selenium/webdriver/chrome/webdriver.py”,第81行,在__
期望的_能力=期望的_能力)
文件“/usr/local/lib/python3.6/dist-packages/selenium/webdriver/remote/webdriver.py”,第157行,在__
启动会话(功能、浏览器配置文件)
文件“/usr/local/lib/python3.6/dist-packages/selenium/webdriver/remote/webdriver.py”,第252行,在启动会话中
响应=self.execute(Command.NEW_会话,参数)
文件“/usr/local/lib/python3.6/dist-packages/selenium/webdriver/remote/webdriver.py”,执行中第321行
self.error\u handler.check\u响应(响应)
文件“/usr/local/lib/python3.6/dist packages/selenium/webdriver/remote/errorhandler.py”,第242行,在check_响应中
引发异常类(消息、屏幕、堆栈跟踪)
selenium.common.exceptions.InvalidArgumentException:消息:无效参数:无法分析功能:代理
来自无效参数:指定“socksProxy”需要“socksVersion”的整数
(驱动程序信息:chromedriver=2.45.615279(12b89733300bd268cff3b78fc76cb8f3a7cc44e5),平台=Linux 4.15.0-43-generic x86_64)

我能够重现这个错误。。。错误在您使用的chrome驱动程序版本中,该版本为
2.45
。我认为这个版本有些不同


所以,你所要做的就是下载以前的chrome webdriver版本。我目前使用的是可以从下载的
2.41

从错误中,听起来您必须为socks的代理版本传入一个整数,这在您的代理列表中缺失。您可能需要将
pxy
列表中的内容与有效socks代理的示例进行比较。另请参见关于将selenium与socks@G.Anderson我早些时候发现了这个问题,但我无法将其付诸实施……你能抽出一分钟的时间,给我写一个简短的例子吗,它会比你想象的更有用…通过一个简单的谷歌搜索发现:它完全符合你想要做的,但使用Firefox是的,我对Chrome驱动程序有怀疑