How to set whitelist for non-English character

58 views
Skip to first unread message

un C

unread,
Jul 29, 2020, 5:39:39 AM7/29/20
to tesseract-ocr
Hi, I am using tesseract v5.0.0-alpha.20200328.

When I tried 'whitelist=0123456789abcd', it does work.

However, when it comes to Chinese...
Neither '-c tessedit_char_whitelist=我愛你'  nor
'-c tessedit_char_whitelist=\u6211\u611b\u4f60' worked.

Can anyone give me a hint? Thanks a lot.
Reply all
Reply to author
Forward
0 new messages