Php 改进tesseract OCR数字识别_Php_Ocr_Tesseract

Php 改进tesseract OCR数字识别

php

Php 改进tesseract OCR数字识别,php,ocr,tesseract,Php,Ocr,Tesseract,我一直在使用tesseract和不同的psm选项，我尝试使用以下模式：当我处理这个的时候，我得到了52658，它把5和8调高，并丢失了小数点。我是否可以更准确地阅读以下内容：图像最初是透明的，我用PHP添加了白色背景，试图给它更好的识别效果，但没有效果。图像太小了我使用ImageMagick调整了它的大小，它开始正确地进行OCR，使用Tesseract 3.02和3.03： $ tesseract 8UAYy.png ooo Tesseract Open Source OCR Engin

我一直在使用tesseract和不同的psm选项，我尝试使用以下模式：

当我处理这个的时候，我得到了52658，它把5和8调高，并丢失了小数点。我是否可以更准确地阅读以下内容：

图像最初是透明的，我用PHP添加了白色背景，试图给它更好的识别效果，但没有效果。

图像太小了

我使用ImageMagick调整了它的大小，它开始正确地进行OCR，使用Tesseract 3.02和3.03：

$ tesseract 8UAYy.png ooo
Tesseract Open Source OCR Engine v3.03 with Leptonica
$ cat ooo.txt 
B2 655

$ convert 8UAYy.png -resize 300% ooo.png
$ tesseract ooo.png ooo
Tesseract Open Source OCR Engine v3.03 with Leptonica
$ cat ooo.txt 
82.685

$ tesseract302 ooo.png ooo
Tesseract Open Source OCR Engine v3.02.02 with Leptonica
$ cat ooo.txt 
82.685

图像太小了

我使用ImageMagick调整了它的大小，它开始正确地进行OCR，使用Tesseract 3.02和3.03：

$ tesseract 8UAYy.png ooo
Tesseract Open Source OCR Engine v3.03 with Leptonica
$ cat ooo.txt 
B2 655

$ convert 8UAYy.png -resize 300% ooo.png
$ tesseract ooo.png ooo
Tesseract Open Source OCR Engine v3.03 with Leptonica
$ cat ooo.txt 
82.685

$ tesseract302 ooo.png ooo
Tesseract Open Source OCR Engine v3.02.02 with Leptonica
$ cat ooo.txt 
82.685

你可以试着对图像和它进行预处理。你可以试着对图像和它进行预处理。是的，我最终也通过使用：

$image->setResolution（150150）后接$image->重采样图像（175175，imagick:：FILTER_UNDEFINED，1）在调整大小后，我还必须使用大量锐化来使其工作。是的，我最终也使用了：$image->setResolution（150150）后接$image->重采样图像（175175，imagick:：FILTER_UNDEFINED，1）在调整大小后，我还必须使用大量锐化来让它工作