I am using Tesseract Version 3.02 for development in Windows. I am running OCR on an image and I get different results from the Tesseract API I write in my development environment and by executing Tesseract from command line.
Sample output from command line (which is the right one):
“t3DMarkLogic"
Tony Agresta
Worldwide VP, Field Operations
MarkLogic Corporation -
Sample output from the Tesseract API code (contains a lot of junk characters):
_ - _ _ V, f’ ._,«.-f A‘ . V ,//7-fir /1”" /’..i../¢_'7*"'
ca : .- ' 8 4
‘D MarkLogic" 1
Tony Agresta . 5
Worldwide VP, Field Operations 1
* MarkLogic Corporation - V
Why are the two outputs different? And what should I change so that the output from my code matches the output produced for the same image from the command line?