How do I train tesseract 4 for the font Comic Sans MS?

278 views
Skip to first unread message

rely LIVE

unread,
Oct 30, 2018, 12:32:22 PM10/30/18
to tesseract-ocr
Hello,

I want to train the default eng.traineddata for the font "Comic Sans MS".
Is it possible at all?
Which files do I need and where do I get them? I already installed tesseract 4 on Ubuntu 18.04 and can do simple OCR.
What are the necessary commands to do training?

I know from the basic tutorial, that I have to use tesstrain.sh and lstmtraining. But it is much too complicated to understand.

Thanks in advance and kindly regards,
Volker

Shree Devi Kumar

unread,
Oct 30, 2018, 2:31:17 PM10/30/18
to tesser...@googlegroups.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/fea4c0a0-2de5-426b-ac0e-8f234fca19eb%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Reply all
Reply to author
Forward
0 new messages