Re: How to output multiple matches/results instead of just one?

401 views
Skip to first unread message
Message has been deleted

Tom Morris

unread,
May 6, 2016, 11:17:43 AM5/6/16
to tesseract-ocr
You'll need to use the API to get that level of control.

On Thursday, May 5, 2016 at 8:22:54 AM UTC-4, shutupyoudo...@gmail.com wrote:

I want to use tesseract to convert images of a single Chinese/Japanese character which will be handwritten by a client-side user on an html5 canvas.


After several tests, I find Tesseract's recognition of handwritten chinese characters to be very good, but sometimes the result is slightly off.  Sometimes I get the correct 木 character as output, but sometimes I get the slightly different 本 character.


How can I make Tesseract output something like a list of top 10 best matches, one of which will likely be the desired result, instead of just outputting one incorrect result?


I want it to do something like:


Command: tesseract tree.gif out -psm 10 -l jpn  ---->  Output (best matches):  本, 木, 休, 十, 八, 六,.......etc


An example image I'm using is below.


Quan Nguyen

unread,
May 6, 2016, 11:56:22 AM5/6/16
to tesseract-ocr

shutupyoudo...@gmail.com

unread,
May 7, 2016, 10:24:20 AM5/7/16
to tesseract-ocr
Thanks, Tom!

shutupyoudo...@gmail.com

unread,
May 7, 2016, 10:25:43 AM5/7/16
to tesseract-ocr
Thanks, Quan!  I appreciate you elaborating and including a link.
Reply all
Reply to author
Forward
0 new messages