Hello,
I am using Tesseract 4.0 and I am trying to OCR some invoices. My problem is that it gives wrong results for some letters, for example I will get a $ or an 8 when the letter is actually S.
The weird things is that some S's are guessed correctly, but some S's or not, and this applies to other letters as well.
My question is, how can I train Tesseract to handle these cases better?
Also, I was wonderinf if Tesseract misinterprets S in S.A. as being a number because of the dots.
I have attached the image that I am having problems with.
Thanks,
Alexandra