Tesseract 4.0.0 fails to extract from some images in PSM 3

33 views

Skip to first unread message

Russia Aiyappa

unread,

Feb 22, 2019, 10:49:09 AM2/22/19

to tesseract-ocr

I used the image attached to perform extraction and I noticed that PSM 3 (default) totally fails to recognize any text and prompts that it is an Empty page. Although PSM 6/7 does recognize this, I am trying to understand why PSM 3 failed. I do understand that "By default Tesseract expects a page of text when it segments an image" but why does it fail to even detect any text in the case of such an image.

Any insight would be much appreciated.

Thank you.

image.tif

Reply all

Reply to author

Forward

0 new messages