Tesseract trained to better detect historical texts

7 views
Skip to first unread message

Subhashish

unread,
9:54 AM (3 hours ago) 9:54 AM
to tesseract-ocr
HI all,

My name is Subhashish and I am a native speaker of the Odia (Oriya) language. I've started training Tesseract on Chapakala 19, a typeface revival of the widely used 19th-century letterpress font.


More details about the process and results: https://github.com/ofdn/tessdata_contrib/tree/main/ori_hist


Subhashish
Reply all
Reply to author
Forward
0 new messages