Tesseract recognizes the characters irrespective of the lines

78 views
Skip to first unread message

Dineshkumar

unread,
Sep 9, 2014, 9:28:15 AM9/9/14
to tesser...@googlegroups.com
What steps will reproduce the problem?
1. Run the Tesseract OCR in Java for the attached image 
2. Save the OCR result in a text file
3. Check the order of the output text file with the attached image.

What is the expected output? What do you see instead?
Expected output -- Expected the result with words in the horizontal left to right order.

Actual output   -- Showing words randomly irrespective of the line order.

What version of the product are you using? On what operating system?
Tesseract 3.01 and Windows 7 

Please provide any additional information below.
The input and expected & actual output are attached for reference.
expected.txt
actual.txt

Satya Swaroop

unread,
Sep 19, 2014, 8:01:41 AM9/19/14
to tesser...@googlegroups.com
I am also facing the same problem.Please post your answer once you find it.

Thanks in advance
Reply all
Reply to author
Forward
0 new messages