preserve_interword_spaces not working in tesseract 4.00.00alpha

63 views
Skip to first unread message

Kazi Moinul Hossain

unread,
Mar 30, 2017, 6:30:12 AM3/30/17
to tesseract-ocr
Hello!

I am using tesseract 4.00.00alpha. I was trying to do OCR on the attached tif image with the following command.


tesseract -l eng -c preserve_interword_spaces=1
170624018949_normal_2.tif outout-preserve-spaces





But in return, i am getting text ignoring all the additional spaces in the image. The output is attached is output-preserve-spaces.txt. There is no significant difference found in the output from the default output(normal.txt).

tesseract 170624018949_normal_2.tif normal


I actually dont get problem if tesseract 4.00.00alpha doesn't allow "preserve_interword_spaces" option to use or is there any additional configuration required during tesseract installation.

Thanks!

normal.txt
output-preserve-spaces.txt
170624018949_normal_2.tif
Reply all
Reply to author
Forward
0 new messages