Frequently recognizes '5' as '9'

34 views
Skip to first unread message

Keith Gorlen

unread,
Jun 1, 2024, 1:47:06 AMJun 1
to tesseract-ocr
I just started using ocrmypdf 16.3.0 with Tesseract 5.3.4.20240503 to extract text from Pacific Gas & Electric PDF bills downloaded from a PG&E account website.  It works OK, but the most frequent error occurring is that '5' is recognized as '9'.  Any tips for improvement?
2911custbill03152024-OCR.pdf
Reply all
Reply to author
Forward
0 new messages