Which is best eng.traineddata

2,902 views
Skip to first unread message

Deepak C R

unread,
Dec 5, 2017, 3:47:58 AM12/5/17
to tesseract-ocr

I have an eng.traineddata which is having 31mb.

and I have downloaded another trained data from:

https://github.com/tesseract-ocr/tessdata_best/blob/master/eng.traineddata

which in site they have claimed that it as best, but is size is only 15mb.

May I know which is giving good accuracy 31mb eng.traineddata or 15 mb eng.traineddata ??

Wang Zhimin

unread,
Dec 14, 2017, 1:46:03 AM12/14/17
to tesseract-ocr
The default one contains legacy traineddata. Therefore the file size is bigger.

If you compare tessdata_best (15MB) and tessdata_fast (5MB), the int version is much smaller. 
Reply all
Reply to author
Forward
0 new messages