Python 无法从图像中提取文本
我一直在从事一个使用tesseract从图像中提取文本的项目,我也在使用python 3.7.7 但是我遇到了一个我无法解决的错误Python 无法从图像中提取文本,python,ocr,python-tesseract,Python,Ocr,Python Tesseract,我一直在从事一个使用tesseract从图像中提取文本的项目,我也在使用python 3.7.7 但是我遇到了一个我无法解决的错误 tess.pytesseract.tesseract_cmd = r'C:\\Program Files (x86)\\Tesseract-OCR\\tess1\\eng.traineddata' img = Image.open('C:\\Users\\USER\\PycharmProjects\\selenium\\automation\\screenshot
tess.pytesseract.tesseract_cmd = r'C:\\Program Files (x86)\\Tesseract-OCR\\tess1\\eng.traineddata'
img = Image.open('C:\\Users\\USER\\PycharmProjects\\selenium\\automation\\screenshot.png')
text = tess.image_to_string(img, lang='eng')
当我运行这个程序时,我得到一个错误
Traceback (most recent call last):
File "C:/Users/USER/PycharmProjects/selenium/automation/open.py", line 8, in <module>
text = tess.image_to_string(img, lang='eng')
File "C:\Users\USER\PycharmProjects\selenium\venv\lib\site-packages\pytesseract\pytesseract.py", line 360, in image_to_string
}[output_type]()
File "C:\Users\USER\PycharmProjects\selenium\venv\lib\site-packages\pytesseract\pytesseract.py", line 359, in <lambda>
Output.STRING: lambda: run_and_get_output(*args),
File "C:\Users\USER\PycharmProjects\selenium\venv\lib\site-packages\pytesseract\pytesseract.py", line 270, in run_and_get_output
run_tesseract(**kwargs)
File "C:\Users\USER\PycharmProjects\selenium\venv\lib\site-packages\pytesseract\pytesseract.py", line 241, in run_tesseract
raise e
File "C:\Users\USER\PycharmProjects\selenium\venv\lib\site-packages\pytesseract\pytesseract.py", line 238, in run_tesseract
proc = subprocess.Popen(cmd_args, **subprocess_args())
File "C:\Python37\lib\subprocess.py", line 800, in __init__
restore_signals, start_new_session)
File "C:\Python37\lib\subprocess.py", line 1207, in _execute_child
startupinfo)
OSError: [WinError 193] %1 is not a valid Win32 application
回溯(最近一次呼叫最后一次):
文件“C:/Users/USER/PycharmProjects/selenium/automation/open.py”,第8行,在
text=tess.image\u to\u字符串(img,lang='eng')
文件“C:\Users\USER\PycharmProjects\selenium\venv\lib\site packages\pytesseract\pytesseract.py”,第360行,图像字符串
}[输出类型]()
文件“C:\Users\USER\PycharmProjects\selenium\venv\lib\site packages\pytesseract\pytesseract.py”,第359行,在
Output.STRING:lambda:run_和_get_Output(*args),
文件“C:\Users\USER\PycharmProjects\selenium\venv\lib\site packages\pytesseract\pytesseract.py”,第270行,在运行和获取输出中
运行_tesseract(**kwargs)
文件“C:\Users\USER\PycharmProjects\selenium\venv\lib\site packages\pytesseract\pytesseract.py”,第241行,运行\u tesseract
提高e
文件“C:\Users\USER\PycharmProjects\selenium\venv\lib\site packages\pytesseract\pytesseract.py”,第238行,运行\u tesseract
proc=subprocess.Popen(cmd_args,**subprocess_args())
文件“C:\Python37\lib\subprocess.py”,第800行,在\uuu init中__
恢复信号,启动新会话)
文件“C:\Python37\lib\subprocess.py”,第1207行,在执行子进程中
startupinfo)
OSError:[WinError 193]%1不是有效的Win32应用程序
请提供合适的解决方案您需要在系统中安装Tesseract软件才能使用Pyteseract。pytesseract只是一个在内部调用OCR引擎Tesseract的库 Tesseract安装 窗户