Training Tesseract for characters in specific environments

42 views
Skip to first unread message

Rockmebabye

unread,
Sep 7, 2017, 2:37:31 AM9/7/17
to tesser...@googlegroups.com
Hi everyone, sorry if this is a silly question -- I've been using Tesseract for a while but never needed to train it so far. I need to OCR a lot of single numbers and single letters that appear alone in small boxes and circles and Tesseract fails a lot at that and manual intervention is too tiresome. I've looked up various guides for training Tesseract but they all want me to build a large TIFF file with walls of text, when I've already got thousands of small TIFF files I cut out. Is there a better way to do this than patching them all back into a huge image for jTessBoxEditor?


Sent with ProtonMail Secure Email.

Quan Nguyen

unread,
Sep 7, 2017, 5:09:18 PM9/7/17
to tesseract-ocr
If you think they are of the same font, you can put them all in a multi-page TIFF.
Reply all
Reply to author
Forward
0 new messages