training handwritten digits

81 views
Skip to first unread message

raymo...@gmail.com

unread,
Aug 10, 2018, 12:43:41 AM8/10/18
to tesseract-ocr
Hi Shree and everyone:

I just noticed that the training process of version 4.00 was updated recently, now I plan to train handwritten digits using version 4.0, 
but before training, I have two questions:

1. is it possible to fine tuning handwritten digits using eng.traineddata, since there is not a model for digits only?
2. how many samples I need to prepare, approximately?
3. Has not GPU been supported during training ?

looking forward to your reply, thanks!

Soumik Ranjan Dasgupta

unread,
Aug 10, 2018, 12:53:31 AM8/10/18
to tesser...@googlegroups.com


On Fri, Aug 10, 2018, 10:13 AM <raymo...@gmail.com> wrote:
Hi Shree and everyone:

I just noticed that the training process of version 4.00 was updated recently, now I plan to train handwritten digits using version 4.0, 
but before training, I have two questions:

1. is it possible to fine tuning handwritten digits using eng.traineddata, since there is not a model for digits only?

1. Yes, its possible, you'd need to change the text corpus to one consisting of digits only. Also, tesseract 4 takes in a fontlist, so try using handwritten fonts for better recognition.   

2. how many samples I need to prepare, approximately?

2. A 200-250 line corpus with 200 - 300 digits per line would suffice. At least, it did for me.

3. Has not GPU been supported during training ?

3. Tesseract 4 does not support GPU acceleration as far as I know.
 
looking forward to your reply, thanks!

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/7291cc77-370b-4a89-a364-7c639d5fb2bc%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Reply all
Reply to author
Forward
0 new messages