$ tesseract --version
tesseract 4.0.0-beta.3
leptonica-1.76.0
libjpeg 9c : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11
Found AVX2
Found AVX
Found SSE
tesseract -l lat --oem 1 --psm 3
drstva devam mahakalam kalikangam mahaprabhum |bhargavah patito bhumau dandavatsurapujite ||bhargava uvacakalyantakalagnisamanabhasamcaturbhujam kalikayopajustam |kapalakhatvangavarabhayadhya-karam mahakalamanantamide ||namah paramarüpaya paramalasurupine |niyatipraptadehaya tattvarupaya te namah ||namah paramarüpaya paramarthaikarupine |viyanmayasvarupaya bhairavaya namo.astute ||OM namah parameésaya paratattvarthadaráine |viyanmayadyadhisaya dhivicitraya $ambhave ||triloke$aya güdhaya suksmayavyaktarupine |parakasthadirupaya paraya $ambhave namah ||OM namah kalikankaya kalatjananibhaya te |jagatsamharakartre ca mahakalaya te namah ||nama ugraya devaya bhimaya bhayadayine |mahabhayavinasaya srstisamharakarine ||namah paraparanandasvarupaya mahatmane |paraprakasarüpaya praka$anam praka$sine ||OM namo dhyanagamyaya yogihrtpadmavasine |vedatantrarthagamyaya vedatantrarthadarsine ||vedagamaparamar$aparamanandadayine |tantravedantavedyaya $ambhave vibhave namah ||dhiyam pracodakam yattu paramam jyotiruttamam |tatprerakaya devaya paramajyotise namah ||gunaérayaya devaya nirgunaya kapardine |atisthulaya devaya hyatisuksmaya te namah ||trigunaya tryadhisaya saktitritayasaline |namastrijyotise tubhyam tryaksaya ca trimürtaye ||
You're telling tesseract that your text is in Latin. You need the traineddata for san-lat.
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/d2fc7942-16a2-48f0-9651-920616179d54%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
You can try IAST ones from https://github.com/Shreeshrii/tessdata_shreetest?files=1
On Fri 27 Jul, 2018, 8:27 AM Shree Devi Kumar, <shree...@gmail.com> wrote:
There is no official traineddata for san_latn or last. I have created some experimental versions but the output is not fully accurate.
On Fri 27 Jul, 2018, 12:21 AM John Muccigrosso, <jmuc...@gmail.com> wrote:
You're telling tesseract that your text is in Latin. You need the traineddata for san-lat.--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesser...@googlegroups.com.
Please try the models from https://github.com/Shreeshrii/tesstrain-Sanskrit-IAST