tesseract for japanese

169 views
Skip to first unread message

jay nirgudkar

unread,
Mar 23, 2016, 3:14:39 AM3/23/16
to tesseract-ocr
Hi, 

I have used the following code snipet:
text = textract.process(
    'path/to/norwegian.pdf',
    method='tesseract',
    language='jpn',
)
but i am still seeing gibberish text when i print it.. I have also tries it with encoding utf_8 and utf_16.. basic Japanese text formats, no luck.. 
please help..
Reply all
Reply to author
Forward
0 new messages