Hi, i was going to recognize a shop tickets, but a success recognition rate is about 80% which is low, right? i'm generating a new traineddata based on letters i cutted of a real tickets. i generate a pseudo text map for training purposes, but the very different letters come out indistinguishable for tesseract and i'm run out of ideas how to achieve at least 99.9% success level with practically no noised image, so please, help me found out me mistake here.
i attached examples:
training image
box file for training image
trained data
input image example (single line)
output for input image