How to train two different language using tesseract 4.0

70 views
Skip to first unread message

易鑫

unread,
Feb 27, 2019, 3:03:17 AM2/27/19
to tesseract-ocr
 Hello,everyone:
     Now I want to recognize the text in the images,but the images do not contain only one language,it contain English and  Chinese,so I want to recognize them simultaneously.
In that case, I will train a model that satisfy English and Chinese,right?

and how to train the model in two languages,thanks in advance.

sorry for my poor English.


Zdenko Podobny

unread,
Feb 27, 2019, 3:21:02 AM2/27/19
to tesser...@googlegroups.com
Why do you think you need to run training?
 eng+ chi_sim does not work? 

Zdenko


st 27. 2. 2019 o 9:03 易鑫 <yixinl...@gmail.com> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/fe9a7a1b-07de-478d-b517-c3ea0a214892%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

易鑫

unread,
Feb 27, 2019, 4:08:14 AM2/27/19
to tesseract-ocr
It only only a few Chinese characters and pat of English letters,if I want to get a high accuracy,maybe retrain the model is a better choice.

Zdenko Podobny <zde...@gmail.com> 于2019年2月27日周三 下午4:21写道:
Reply all
Reply to author
Forward
0 new messages