Advice for working with low-resolution images

48 views

Skip to first unread message

Bhargav Kowshik

unread,

Nov 6, 2022, 10:37:29 AM11/6/22

to tesseract-ocr

In the last few days it has been a joyful experience getting to know Tesseract and using it in one of my projects. Thank you all.

The challenge I am facing is that the OCR detection is not correct on some of the PDF files I am working with and I am writing to this group seeking advice on how I could do this better. The PDF file has identifiers like below which when OCR'ed with Tesseract gives the result "WaJES58865" while the right answer is "UZJ6358865".

While checking on an online tool in https://www.imagetotext.info/ the OCR text is correct. So, happy yo hear to any tips and tricks to get the right detections using Tesseract too.

Reply all

Reply to author

Forward

0 new messages