Status: New
Owner: ----
New issue 1256 by
chemm...@gmail.com: Pinyin diacritics
http://code.google.com/p/tesseract-ocr/issues/detail?id=1256
Hi !
I need to OCR texts containing pinyin diacritics ( o ā ɑ̄ ē ī ō ū ǖ / Ā Ē Ī
Ō Ū Ǖ /á ɑ́ é í ó ú ǘ / Á É Í Ó Ú Ǘ / ǎ ɑ̌ ě ǐ ǒ ǔ ǚ / Ǎ Ě Ǐ Ǒ Ǔ Ǚ / à ɑ̀ è
ì ò ù ǜ / À È Ì Ò Ù Ǜ / a ɑ e i o u ü / A E I O U o ā ɑ̄ ē ī ō ū ǖ / á ɑ́ é
í ó ú ǘ /ǎ ɑ̌ ě ǐ ǒ ǔ ǚ / à ɑ̀ è ì ò ù ǜ / a ɑ e i o u ü) which the
software either does not recognize or even mix up.
It'd be great also to recognize shorthand drawings.
I've tried several softwares, training, creating user languages and adding
them to their dictionaries etc, finding no success at all.
I would really appreciate some advice on how to solve this problem, if
possible, as soon as possible so that I can decide whether to acquire it or
not.
Thanks in advance!
--
You received this message because this project is configured to send all
issue notifications to this address.
You may adjust your notification preferences at:
https://code.google.com/hosting/settings