Training Tesseract 3.04 for a custom font using tesstrain.sh

32 views
Skip to first unread message

BLau

unread,
Feb 27, 2019, 11:12:33 AM2/27/19
to tesseract-ocr

Hello,

I was trying to train Tesseract using tesstrain.sh in order to make it learn a custom font, but I'm struggling right now.
I have the right langdata from official repos (with training_text, bigrams, unicharambigs etc...) , I also have the traineddata for the same language wich also comes from official repos.
But I don't understand how to use a custom font, by custom font I mean a font for which I only know the name / have real life examples. 

I've tried a tutorial following which I had to cut out some text pieces then generate .box files out of these. Then I had to correct box files and give it back to tesseract using box.train command.
As this method didn't work for me, I've searched another way to do this and I found a way to train using tesstrain.sh but I'm stuck because I don't know how to give my custom font.


So if someone could explain me how to do it or has good link that explain I'll be thankfull, I've search in lot of different places and I'm missing something to understand how to give a custom font to train  with

I followed this tutorial :
and tried using this script :
Reply all
Reply to author
Forward
0 new messages