Please forgive the newbie question. I've seen this posted several
times before, and I thought I had the right solution but apparently
not. Attached is a PNG that I'd like to run through tesseract. I
used ImageMagick's convert to change it into a tiff:
convert -density 200 -units PixelsPerInch test_page.png -type
Grayscale +compress test_input.tif
(I've also tried to do this at -density 300 with the same results)
The resulting TIF is attached. When I run it through tesseract I get
an output file that is one byte and is basically blank. Command and
output below.
tesseract test_input.tif output -l eng
Tesseract Open Source OCR Engine
Image has 8 * 1 bits per pixel, and size (375,350)
Resolution=200
I saw some other threads about a similar problem, but the solutions
were to scale it to 200 or 300 DPI, make sure it was in grayscale,
remove the alpha layer, and somewhere else it said it was fixed in
Tesseract 2.04. I'm using Tesseract 2.04 on Mac OS X 10.6.6 and
ImageMagick 6.6.7-1. Is my image just unsuitable for OCR-ing?
I appreciate any help.
Thanks,
Bob
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com.
To unsubscribe from this group, send email to tesseract-oc...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.