Single Character Recognition? How to change page segmentation mode? I'm a complete newbie.

105 views
Skip to first unread message

TMT888

unread,
Sep 30, 2017, 7:48:06 AM9/30/17
to tesseract-ocr
I am using capture2text which uses tesseract.

However I need it to recognize single characters. In my search I have found I need to change page segmentation mode to single characters (mode 10 I believe). I can't figure out where to go from here though. Everything I've read I don't understand, I barely know HTML and never learned any coding.

Dan9er

unread,
Sep 30, 2017, 11:41:01 AM9/30/17
to tesseract-ocr
tesseract input.jpg out --psm 10

David Sixela

unread,
Sep 30, 2017, 1:23:44 PM9/30/17
to tesseract-ocr
Are you willing to do that within the terminal (if yes see Dan9er's answer) or you're using a particular language ?
Message has been deleted

TMT888

unread,
Sep 30, 2017, 3:03:18 PM9/30/17
to tesseract-ocr
I am willing, I got to this:


I am not sure what to do from here. But I'll keep trying different things.

David Sixela

unread,
Sep 30, 2017, 3:32:05 PM9/30/17
to tesseract-ocr
I don't know much about Capture2text, but according to the picture you just sent, make sure to give the right filename and the right path to it.
Is the picture you're trying to read called "input.jpg" ? If yes, is it saved in "C:\Capture2Text_v3.8\Utils\Tesseract" ?

After googling "Capture2Text", you should try to to check the settings according to this http://capture2text.sourceforge.net/#specify_ocr_language , you might be able to set the page segmentation mode somewhere in there.

Hope this can help.

TMT888

unread,
Sep 30, 2017, 9:15:28 PM9/30/17
to tesseract-ocr
Thanks very much for the help, I think I got it finally :)

Dan9er

unread,
Oct 3, 2017, 9:31:45 AM10/3/17
to tesseract-ocr
Ew, Sourceforge!
If I were you I would IMMEDIATELY install Malwarebytes and run a scan because EVERYTHING from freeware sites is stuffed with malware. NEVER GET ANYTHING FROM SOURCEFORGE.

David Sixela

unread,
Oct 3, 2017, 9:36:45 AM10/3/17
to tesseract-ocr
LOL Dan9er, i didn't download anything from there, i was just showing the document
Reply all
Reply to author
Forward
0 new messages