I am trying to train Tesseract for Urdu Nastaleeq fonts. I used 10 Text files of total 1 MB and gave them to the jTesseract editor to create box files and then create traineddata file. But It gives an error: Error: unichar بجا in normproto file is not in unichar set. The output that comes is also very inaccurate. Can somebody help me with this?
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/3106acc3-fb3f-4816-9a07-a3a31b79c66a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/faa6c1e2-2846-4704-8e50-0ed3e7728302%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8xtDk8htkysLWrQRS6eDN1676tcZWCrRbi6MAvfk6NqLA%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAOYxz4oy1Ydfo3VkM3F04fLv-bvheJX_Qpaui9370AtH50e3vA%40mail.gmail.com.