Tesseract for Android for Hindi language

241 views
Skip to first unread message

Harshit Dohare

unread,
Feb 27, 2018, 3:32:33 PM2/27/18
to tesseract-ocr
Hello Everyone,

I am new to tesseract and I am trying to build android app for Hindi OCR using tess-two on Android Studio.

I am able to make the android app work for English and few other languages. However when I tried with Hindi, the app crashes. I  can't figure out the error. 
I have learnt that for Hindi and Arabic, I need to use .cube files also. However putting .cube files with .traineddata file does not help and the android app still crashes. What do I have to do to use these .cube files in Android Studio?

Can somebody help me in this or provide some useful links for something related to this?

Thanks
Harshit

ShreeDevi Kumar

unread,
Feb 27, 2018, 9:21:38 PM2/27/18
to tesser...@googlegroups.com
>  Hindi OCR using tess-two on Android Studio.

Probably uses old version of tesseract and traineddata.

For Hindi, you will get best result with tesseract (version 4.00alpha) and traineddata files from tessdata_fast

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/22b32f68-eff0-4bbb-b0b1-737b0e5deb34%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Harshit Dohare

unread,
Feb 28, 2018, 2:04:27 AM2/28/18
to tesseract-ocr
Thanks for the reply.

Since I am working to build android app using Android Studio, I am using tess-two and I have found that tess-two works with Tesseract version 3 as of now. So I can't use Tesseract version 4 and new trained files.
Any other idea how can I go ahead?

Thanks,
Harshit

ShreeDevi Kumar

unread,
Feb 28, 2018, 2:19:18 AM2/28/18
to tesser...@googlegroups.com
Please see https://github.com/tesseract-ocr/tesseract/issues/875#issuecomment-369143904

You maybe able to build tesseract4 to use with tess-two using the suggestion in that thread.

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

ada...@turningcloud.com

unread,
Feb 28, 2018, 2:25:06 AM2/28/18
to tesseract-ocr
@shree please look into my post also. Its kind of urgent. Hope that you reply.


On Wednesday, February 28, 2018 at 12:49:18 PM UTC+5:30, shree wrote:
Please see https://github.com/tesseract-ocr/tesseract/issues/875#issuecomment-369143904

You maybe able to build tesseract4 to use with tess-two using the suggestion in that thread.
On 28-Feb-2018 12:34 PM, "Harshit Dohare" <harshi...@gmail.com> wrote:
Thanks for the reply.

Since I am working to build android app using Android Studio, I am using tess-two and I have found that tess-two works with Tesseract version 3 as of now. So I can't use Tesseract version 4 and new trained files.
Any other idea how can I go ahead?

Thanks,
Harshit

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages