Advice for working with low-resolution images

48 views
Skip to first unread message

Bhargav Kowshik

unread,
Nov 6, 2022, 10:37:29 AM11/6/22
to tesseract-ocr
In the last few days it has been a joyful experience getting to know Tesseract and using it in one of my projects. Thank you all.

The challenge I am facing is that the OCR detection is not correct on some of the PDF files I am working with and I am writing to this group seeking advice on how I could do this better. The PDF file has identifiers like below which when OCR'ed with Tesseract gives the result "WaJES58865" while the right answer is "UZJ6358865". 
after.png

While checking on an online tool in https://www.imagetotext.info/ the OCR text is correct. So, happy yo hear to any tips and tricks to get the right detections using Tesseract too.
Reply all
Reply to author
Forward
0 new messages