hebrew training data files

220 views
Skip to first unread message

Roi Dayan

unread,
May 18, 2011, 4:00:25 PM5/18/11
to tesser...@googlegroups.com
Now that my B.Sc. project is behind me I can share the tesseract training data I compiled for Hebrew

Links:

training_fonts.zip - the training files I used
tesseract-2.00.heb.tar.gz - compiled for tesseract 2
heb.traineddata.gz - compiled for tesseract 3


Roi Dayan

unread,
May 18, 2011, 3:57:51 PM5/18/11
to tesser...@googlegroups.com, Eyal Cohen, Patrick Questembert
training_fonts.zip – the training files I used
tesseract-2.00.heb.tar.gz – compiled for tesseract 2
heb.traineddata.gz 
– compiled for tesseract 3


--

Roi 


Dmitri Silaev

unread,
May 18, 2011, 11:13:17 PM5/18/11
to tesser...@googlegroups.com, roi....@gmail.com
Great contribution to the Tesseract community, thanks!

Did you do any special pre- or post- processing for Hebrew, and/or set
any config params?
It'd be nice if you share...

Warm regards,
Dmitri Silaev
www.CustomOCR.com

--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com
To unsubscribe from this group, send email to
tesseract-oc...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Roi Dayan

unread,
May 20, 2011, 1:28:00 AM5/20/11
to tesser...@googlegroups.com, roi....@gmail.com
Hi,

I didn't do anything special, its just collection of the fonts you see see in the trainings fonts archive

In my application itself that I used it with I did some normal image pre processing
like converting to gray scale and sharp

Dmitri Silaev

unread,
May 20, 2011, 8:06:26 AM5/20/11
to tesser...@googlegroups.com, roi....@gmail.com
OK, thanks again for sharing!

--
Dmitri

Reply all
Reply to author
Forward
0 new messages