Re: [tesseract-ocr] tessdata_best traineddata FIles

65 views
Skip to first unread message
Message has been deleted

ShreeDevi Kumar

unread,
Feb 1, 2018, 6:07:31 AM2/1/18
to tesser...@googlegroups.com
Latin - for Latin script including languages such as eng, deu, spa etc

lat - for Latin language

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Thu, Feb 1, 2018 at 4:31 PM, James Q <james.qu...@taina.tech> wrote:
The following appear to be both Latin, so can anyone tell me what the difference is between:
Latin.traineddata
and:
lat.traineddata
apart from the fact that the first one is 10 times bigger?

Thanks
James

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/d8e539ff-f924-4d78-b5f7-dc80d8855342%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Message has been deleted
Message has been deleted

ShreeDevi Kumar

unread,
Feb 1, 2018, 6:39:47 AM2/1/18
to tesser...@googlegroups.com
You are correct. Latin script is available only for LSTM mode --oem 1, with traineddata files in tessdata_best and tessdata_fast.

Similarly for all other script traineddata files - names starting with CAPITAL letters.

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Thu, Feb 1, 2018 at 4:46 PM, James Q <james.qu...@taina.tech> wrote:

Thanks Shree, So presumably then there is no Latin Script traineddata for Tesseract_Only mode?

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
Reply all
Reply to author
Forward
0 new messages