How to improve the recognition of receipt (text not in words dictionary)

251 views
Skip to first unread message

Laura

unread,
Jun 20, 2017, 2:35:25 AM6/20/17
to tesseract-ocr

Hi, I’m new on tesseract. I’m trying to recognize receipts. Since on receipts, lots of text are not dictionary words. I disabled the dictionaries,  it increased the recognition rate, but it’s still low, I’d like to create my own dictionary with the product catalog.

Is there someone who can give the tutorial to do it ?

Many thanks !

Laura

ShreeDevi Kumar

unread,
Jun 20, 2017, 3:24:22 AM6/20/17
to tesser...@googlegroups.com

on stable 3.0x you can try by adding your product catalog to eng.user-words file and check for improvement.

In my unit test, it seemed to apply the words from user dict.

Alternately, you can also try withthe development version tesseract 4, --oem 1 directly - I don't think user-words work with it, but it might give you better recognition.

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/ee61c476-8aee-4d58-a3a7-2bbf5d292eb8%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

sfo

unread,
Jun 29, 2017, 10:52:07 AM6/29/17
to tesseract-ocr
hello Laura! could you please tell me how did you
disable the dictionaries?

srn...@gmail.com

unread,
Jul 13, 2017, 8:11:38 AM7/13/17
to tesseract-ocr
Hello laura, can you please tell me, have you have achieved this or not. Iam alos trying to do same thing , and if yes, can you please give any advise.
Reply all
Reply to author
Forward
0 new messages