On Fri, Jul 04, 2014 at 02:08:46AM -0700, Meenal Goyal wrote:
> If you're sure that all the words you will encounter will be in the
> dictionary this should help somewhat:
>
https://code.google.com/p/tesseract-ocr/wiki/FAQ#How_to_
> increase_the_trust_in/strength_of_the_dictionary?
>
> The words won't always be in dictionary so I tried adding them in file
> eng.user-words but i m confused about the weightage given to this file against
> the already defined dictionaries.
> Also, I have read that post earlier about strengthening the dictionary and
> tried to modify some variables in the configuration file. But then it starts
> recognizing wrong words, may be its the case of over-correcting.
Yes, that's the problem with just emphasising the dictionary.
be very hard to stop it producing garbage output. So I'm afraid