How to get multiple results (i.e. alternative words), each with its own confidence?

39 views
Skip to first unread message

Eli Marmor

unread,
Jul 12, 2019, 5:21:24 AM7/12/19
to tesseract-ocr
I'm newbie (and it's also my first post to this group), so please excuse me if it's a silly question...
Tesseract gives me the best matching word for each word in the image.
I can also get the confidence, if I choose the output to be in hOCR format.
Unfortunately, I found no way to tell Tesseract to give me alternative results, with lower confidences.
I'll be glad to learn how to do it, either from the command line, or from C-API.

Thanks in advance,
Eli Marmor

Zdenko Podobny

unread,
Jul 12, 2019, 5:23:34 AM7/12/19
to tesser...@googlegroups.com

pi 12. 7. 2019 o 11:21 Eli Marmor <e...@netmask.it> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/e8cfdd82-6629-4738-bf83-a4fb290e309a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Eli Marmor

unread,
Jul 12, 2019, 5:26:54 AM7/12/19
to tesseract-ocr
Thank you very much!
Funny, I read that page, but didn't notice this specific example...

בתאריך יום שישי, 12 ביולי 2019 בשעה 12:23:34 UTC+3, מאת zdenop:

pi 12. 7. 2019 o 11:21 Eli Marmor <e...@netmask.it> napísal(a):
I'm newbie (and it's also my first post to this group), so please excuse me if it's a silly question...
Tesseract gives me the best matching word for each word in the image.
I can also get the confidence, if I choose the output to be in hOCR format.
Unfortunately, I found no way to tell Tesseract to give me alternative results, with lower confidences.
I'll be glad to learn how to do it, either from the command line, or from C-API.

Thanks in advance,
Eli Marmor

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesser...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages