In the last few days it has been a joyful experience getting to know Tesseract and using it in one of my projects. Thank you all.
The challenge I am facing is that the OCR detection is not correct on some of the PDF files I am working with and I am writing to this group seeking advice on how I could do this better. The PDF file has identifiers like below which when OCR'ed with Tesseract gives the result "WaJES58865" while the right answer is "UZJ6358865".
While checking on an online tool in
https://www.imagetotext.info/ the OCR text is correct. So, happy yo hear to any tips and tricks to get the right detections using Tesseract too.