Preparation of a specific character-set traineddata

83 views
Skip to first unread message

Karol Wójcik

unread,
Jan 5, 2024, 1:55:53 AM1/5/24
to tesseract-ocr
Hi there, 

So far I've been using https://github.com/Shreeshrii/tessdata_shreetest/blob/master/digits_comma.traineddata. Generally speaking, with very good results, much better than when using eng-best or eng-fast from standard tesseract repo. But, unfortunately, recently I came across some unrecognized characters when ocr-ing my data sets and it seems it's blocking further development of my software.

I tried to fine tune it myself, but unfortunately the results got worse :( So I'm looking for somebody willing to create a specialized traineddata for me. It would require a few additional characters added along to digits_comma.traineddata. I would want to achieve the same accuracy as when using digits_comma.traineddata.  

I'd be more than happy to pay premium for such work.

Best Regards,
Karol

Karol Wójcik

unread,
Jan 15, 2024, 5:16:21 PM1/15/24
to tesseract-ocr
Nobody? :(
Reply all
Reply to author
Forward
0 new messages