OCR can't recognize some old Punjabi (Gurmukhi) characters

34 views
Skip to first unread message

Zaki Rangwala

unread,
Jul 8, 2021, 11:10:20 AM7/8/21
to tesseract-ocr
I am trying to analyze the GGS, and some characters are not recognized by the OCR due to missing fonts. Is there already a source that has selectable GGS text and is it my computer that is missing the fonts? Are they any pre-trained models that can recognize this? 

For example, I enclosed an image in this image and something like this does not get properly recognized properly by the OCR. What can I do to improve my results?
Screen Shot 2021-07-08 at 11.00.12 AM.png
Reply all
Reply to author
Forward
0 new messages