tesseract with no language words

1,036 views
Skip to first unread message

steven lee

unread,
Jun 14, 2016, 4:18:26 AM6/14/16
to tesseract-ocr
Hi,
so I have to make OCR which will scan cipher text, as we know that cipher text is not a language, so I wonder what option should I put on the language option to do this?

Stef

unread,
Jun 16, 2016, 11:09:42 AM6/16/16
to tesseract-ocr
If the cipher text uses latin characters, I'd use -l eng, switch off the english word lists (load_system_dawg=0 and load_freq_dawg=0) and set the tessedit_char_whitelist to the characters that are possible in your cipher text.

Stef

Reply all
Reply to author
Forward
0 new messages