I trained Tesseract (based on the eng language) to work with a particular customer derived font.
hen I have finished training, I use other image files and try and scan for characters, during this I get errors in reading a 0 (zero). It will come up sometimes as an 8, Q, D or an O (the letter).
The images I am using are not mixing up font size nor putting in lower case.
I have tried using the unicharambigs file, but I'm not sure that it is being implemented correctly. I renamed it to eng.unicharambigs as well and nothing happened there. I even tried to set the Type Indicator value to 1 to mandate the substitution and nothing happened.
Anyway, I feel I'm missing something simple, but have wound myself around this problem where I put myself in the middle of the forest.
If someone can point me in a proper direction or give a few pointers I would appreciate the help.
The files created or used:
Thanks,
Jim