preserve_interword_spaces not working in tesseract 4.00.00alpha

63 views

Skip to first unread message

Kazi Moinul Hossain

unread,

Mar 30, 2017, 6:30:12 AM3/30/17

to tesseract-ocr

Hello!

I am using tesseract 4.00.00alpha. I was trying to do OCR on the attached tif image with the following command.


tesseract -l eng -c preserve_interword_spaces=1 170624018949_normal_2.tif outout-preserve-spaces

But in return, i am getting text ignoring all the additional spaces in the image. The output is attached is output-preserve-spaces.txt. There is no significant difference found in the output from the default output(normal.txt).

tesseract 170624018949_normal_2.tif normal

I actually dont get problem if tesseract 4.00.00alpha doesn't allow "preserve_interword_spaces" option to use or is there any additional configuration required during tesseract installation.

Thanks!

normal.txt

output-preserve-spaces.txt

170624018949_normal_2.tif

Reply all

Reply to author

Forward

0 new messages