I tried using bazaar with my user-words and results are way much better, also working on image pre-processing contributed to improve output.
I have another issue now: I expanded my list of user-words to about 7000 words but i get this error:
>>Error: word '......' not in DAWG after adding it
>>Error: failed to load /usr/local/share/tessdata/ita.user-words
I found a report of the problem here:
https://code.google.com/p/tesseract-ocr/issues/detail?id=1020but still I don't know how to solve it. Reading through the source code (in dict.h) I found, like in the report:
static const int kMaxUserDawgEdges = 50000;
is this that cause the error? But my list is of 7000 words, which is much less than 50000...
I don't understand.
Thank you very much.
Elena