Adding a custom dictionary

2,399 views
Skip to first unread message

nathan

unread,
Nov 10, 2009, 5:54:57 PM11/10/09
to tesseract-ocr
I've heard you can define a custom dictionary for Tesseract... but I
haven't been able to find any documentation on doing so.

Does anyone have any experience with this?

Thanks

Ray Smith

unread,
Nov 11, 2009, 12:28:45 AM11/11/09
to tesser...@googlegroups.com
Put the wordlist in <lang>.user-words or recreate <lang>.word-dawg using wordlist2dawg.
Ray.

noha radwan

unread,
Jun 2, 2015, 8:26:14 AM6/2/15
to tesser...@googlegroups.com
Hello,

I tried following the approach from this post: stackoverflow.com/questions/9568165/custom-dictionary-for-tesseract
However it doesn't seem to make any difference.

Please correct me if I am wrong but the way I understand it is as follows: when following that approach, I basically erase the eng.traineddata file and replace it with my own custom words file. Does this then mean that tesseract can only recognize words from the custom defined ones? Because following this method, did not manage to change the detection by tesseract at all.

I appreciate any help I can get.

Thanks,
Noha
Reply all
Reply to author
Forward
0 new messages