tesseract for japanese

169 views

Skip to first unread message

unread,

Mar 23, 2016, 3:14:39 AM3/23/16

to tesseract-ocr

Hi,

I have used the following code snipet:

text = textract.process(
    'path/to/norwegian.pdf',
    method='tesseract',
    language='jpn',
)

but i am still seeing gibberish text when i print it.. I have also tries it with encoding utf_8 and utf_16.. basic Japanese text formats, no luck..

please help..

Reply all

Reply to author

Forward

0 new messages