Tesseract misinterprets letters in an invoice

59 views

Skip to first unread message

Alexandra

unread,

Jan 8, 2018, 9:58:56 AM1/8/18

to tesseract-ocr

Hello,

I am using Tesseract 4.0 and I am trying to OCR some invoices. My problem is that it gives wrong results for some letters, for example I will get a $ or an 8 when the letter is actually S.

The weird things is that some S's are guessed correctly, but some S's or not, and this applies to other letters as well.

My question is, how can I train Tesseract to handle these cases better?

Also, I was wonderinf if Tesseract misinterprets S in S.A. as being a number because of the dots.

I have attached the image that I am having problems with.

Thanks,

Alexandra

26693920_10208426535819481_1357486626_n.png

Reply all

Reply to author

Forward

0 new messages