where can i find chinese original training data for re-train tesseract 4.0

150 views
Skip to first unread message

5143...@qq.com

unread,
Aug 18, 2017, 3:35:06 AM8/18/17
to tesseract-ocr
hi,all:

    I want to  re-train tesseract 4.0 for chinese , i find  https://github.com/tesseract-ocr/langdata   just for  tesseract 3.0,
    
    Appreciate for any help.

ShreeDevi Kumar

unread,
Aug 18, 2017, 4:33:16 AM8/18/17
to tesser...@googlegroups.com
langdata has NOT been updated for 4.0.

Please wait for update from Ray.

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/054046ac-8ff0-44fa-9361-12711de7fbf8%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

5143...@qq.com

unread,
Aug 18, 2017, 4:44:17 AM8/18/17
to tesseract-ocr
who is Ray?  How to contact him?

在 2017年8月18日星期五 UTC+8下午4:33:16,shree写道:
langdata has NOT been updated for 4.0.

Please wait for update from Ray.

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Fri, Aug 18, 2017 at 12:42 PM, <5143...@qq.com> wrote:
hi,all:

    I want to  re-train tesseract 4.0 for chinese , i find  https://github.com/tesseract-ocr/langdata   just for  tesseract 3.0,
    
    Appreciate for any help.

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.

ShreeDevi Kumar

unread,
Aug 18, 2017, 4:50:53 AM8/18/17
to tesser...@googlegroups.com
The lead developer of tesseract-ocr is Ray Smith (at Google). @theraysmith on github

He is in the process of updating the files for 4.0.0 beta release soon. 


ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.

To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

5143...@qq.com

unread,
Aug 18, 2017, 4:59:12 AM8/18/17
to tesseract-ocr

tks,really good news

在 2017年8月18日星期五 UTC+8下午4:50:53,shree写道:
Reply all
Reply to author
Forward
0 new messages