Attached are two images containing two numbers, 81 and 82, which I am attempting to get Tesseract OCR to read.
Each time Tesseract OCR is returning empty page and producing an empty text.txt document.
The error is displaying as follows:
# tesseract 82.png out
Tesseract Open Source OCR Engine v4.1.1-rc2-20-g01fb with Leptonica
Warning: Invalid resolution 0 dpi. Using 70 instead.
Estimating resolution as 1622
Empty page!!
Estimating resolution as 1622
Empty page!!
How can I get the numbers to output? Are any changed required to the images or to tesseract?
These images have been produced using Centos 7, Apache, PHP and Imagick.
Retrieving the image from an external server, then processing the image using Imagick to crop, grayscale, trim to focus area, resize, smooth edges, remove background, set image to black and white, flatten the image, set a resolution and image format.
These images have then been saved (for development purposes) and tested using the above.
Once these errors are sorted and it's running, tesseract-ocr-php will complete the process on the fly (as there's around 6000 images to read).
Let me know.
Thank you (in advance).