Simple text bad results

67 views
Skip to first unread message

Tyler

unread,
Jun 22, 2023, 1:13:01 PM6/22/23
to tesseract-ocr
Hi,

I'm having issues getting good accuracy for some text. I have read through https://github.com/tesseract-ocr/tessdoc/blob/main/ImproveQuality.md as well as tried many different image preprocessing such as resizing, dilation/erosion, thresholding, and applying blurs. I have also used appropriate whitelisted characters but still cannot get great accuracy.

See attached some example images I am having trouble with. I am using Tesseract 4.1.1, and pytesseract 0.3.10.

I'd appreciate any pointers or ideas to get better accuracy. At this point, I am thinking I may have to fine tune the model.
short.png
mid.png
long.png
Reply all
Reply to author
Forward
0 new messages