--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/0470b38b-e14c-489b-a853-80599a7f79bc%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
which has info specifically for jpn and Japanese.
ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
On Wed, Jan 17, 2018 at 3:10 PM, ShreeDevi Kumar <shree...@gmail.com> wrote:
Initial capitals indicate the one model for all langs in that script, so eg Latin is all latin-based languages except vie, which has its own Vietnamese. Most of the script models include English training data as well as the script, but not for Cyrillic, as that would have a major ambiguity problem. Devanagari is hin+san+mar+nep+eng, and Fraktur is basically a combination of all the latin-based languages that have an 'old' variant, etc... I would be interested to hear more feedback on the Script models
ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
On Wed, Jan 17, 2018 at 10:10 AM, Moriya Takasi <tak...@moriyan.jp> wrote:
Hello,The trained data repository tessdata-fast has two trained data for same language such as eng.traineddata and English.traineddata. I could not find the explanation about the difference.In case for Japanese, the difference of results between the Japanese.traineddata and the jpn.traineddata were often not small.Where should I look for the description about it?
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.