Fonts used in training Tesseract 4 eng Model

46 views
Skip to first unread message

Raniem

unread,
Nov 5, 2018, 5:05:37 AM11/5/18
to tesseract-ocr
Hello All

I have been trying to train the eng model from scratch (trying to experiment with different net specs that might be a little bit faster) but was way too far from a good accuracy (except for on training data).
I have seen the fonts list used in the langdata-lstm repository and was wondering if those font are all available somewhere to download or if there is a way to collect them to continue my experiment. 
I know training from scratch is a daunting task but I would like to see if I can replicate at least a close model (accuracy wise) on my own.
I am following the training steps but only need to collect all this fonts, i thought they might already be uploaded somewhere as .rtf files or whatever but need to be guided to that place.

Any ideas please?
Appreciate your time reading my post! Thanks.

Regards
Reply all
Reply to author
Forward
0 new messages