How to do OCR for 96dpi screenshots from computer display with 100% accuracy?

40 views
Skip to first unread message

Vadim Melnik

unread,
Oct 19, 2023, 1:07:30 AM10/19/23
to tesseract-ocr
Hello,

We are processing screenshot PNG images from computer display with 96dpi resolution. This is just B/W images rendered with known truetype single font with fixed size, w/o antialiasing or any other subpixel rendering things. Picture structure is clear, opaque and pixelated like listed below, character or glyph always have the same 2D structure:

screen1.png

Main goal is simple, fast and efficient (in time and memory terms) OCR of these screendumps with 100% accuracy. We tried tesseract 4/5 with old Cubic and new LSTM models in default mode with average results (60%), then trained both models with custom font and 288-384dpi upscale, unfortunately final output is still not good - recognition is
definitely better around 90%+, but not 100% and additional upscaling increases memory and processing in 10 times.

Does anyone know if Tesseract provides some kind of configuration or functionality to do this kind of OCR? Or may be some other open-source OCRs better fit this task, like EasyOCR, OpenCV, OCRopus, GOCR etc..

--
Thanks,
Vadim.
Reply all
Reply to author
Forward
0 new messages