Tesseract 2 will be rubbish for Chinese. Tesseract 3 has specific
support for Chinese/Japanese/Korean.
> I read the introduction in http://code.google.com/p/tesseract-ocr/w/list,
> but when I do my training run into some problem. Here are the steps i
> did:
>
> 1.tesseract 1.tif 1 batch.nochop makebox--------------make a txt file
> 2.Remane 1.txt to 1.box, then use bbtesseract to adjustment.
> 3.Tesseract 1.tif junk nobatch box.train --------make 1.tr and
> junk.txt
> 4.mftraining scan.tr5.cnTraining scan.tr6.unicharset_extractor
> scan.box
>
> Ok, there are inttemp / normproto/ pffmtable/ unicharset, but how do i
> use them?
> Did I do something wrong?
>
Err... you'd have to read further in the training document, where
that's explained.
> Thinks a lot!
>
> --
> You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
> To post to this group, send email to tesser...@googlegroups.com.
> To unsubscribe from this group, send email to tesseract-oc...@googlegroups.com.
> For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.
>
>
--
<Leftmost> jimregan, that's because deep inside you, you are evil.
<Leftmost> Also not-so-deep inside you.