Tesseract misinterprets letters in an invoice

59 views
Skip to first unread message

Alexandra

unread,
Jan 8, 2018, 9:58:56 AM1/8/18
to tesseract-ocr

Hello,


I am using Tesseract 4.0 and I am trying to OCR some invoices. My problem is that it gives wrong results for some letters, for example I will get a $ or an 8 when the letter is actually S.

The weird things is that some S's are guessed correctly, but some S's or not, and this applies to other letters as well.

My question is, how can I train Tesseract to handle these cases better?

Also, I was wonderinf if Tesseract misinterprets S in S.A. as being a number because of the dots.

I have attached the image that I am having problems with.


Thanks,

Alexandra

26693920_10208426535819481_1357486626_n.png
Reply all
Reply to author
Forward
0 new messages