Tesseract 4.0.0 fails to extract from some images in PSM 3

33 views
Skip to first unread message

Russia Aiyappa

unread,
Feb 22, 2019, 10:49:09 AM2/22/19
to tesseract-ocr
I used the image attached to perform extraction and I noticed that PSM 3 (default) totally fails to recognize any text and prompts that it is an Empty page. Although PSM 6/7 does recognize this, I am trying to understand why PSM 3 failed. I do understand that "By default Tesseract expects a page of text when it segments an image" but why does it fail to even detect any text in the case of such an image.

Any insight would be much appreciated.

Thank you.
image.tif
Reply all
Reply to author
Forward
0 new messages