On 30 June 2015 at 11:46, Kus WikzSL <
spkm...@gmail.com> wrote:
> Hi All,
> I am currently doing my undergraduate project. It include a OCR part
> for "SInhala" language (primary language of sri lanka).
> I hope to doing using teseract. But the problem is there is no train data
> for sinhala language. Can any one help me to describe how to train for a
> new language.
I think 'sin' is Sinhala:
https://github.com/tesseract-ocr/tessdata/blob/master/sin.traineddata?raw=true
(it was added a few days ago).
--
<Sefam> Are any of the mentors around?
<jimregan> yes, they're the ones trolling you