Hi Carl,
> I haven't been able to figure out how to do either of those, but I get the
> feeling that is the wrong direction.
No, it sounds right, and you're nearly there. The relevant
documentation for you is the "CONFIG FILES AND AUGMENTING WITH USER
DATA" section of the manual[0].
So, call your word list eng.user-words, put it in the tessdata
directory, then create a config file called 'customwords' in the
tessdata/configs directory, with the following contents:
load_system_dawg F
load_freq_dawg F
user_words_suffix user-words
Note that when I say "the tessdata directory", I mean a directory
that by default will probably be /usr/share/tesseract-ocr/tessdata.
Hope that helps.
Nick
0.
http://tesseract-ocr.googlecode.com/svn-history/trunk/doc/tesseract.1.html#_config_files_and_augmenting_with_user_data