Different OCR outputs from command line and code

108 views

Skip to first unread message

Amey Dharwadker

unread,

Jun 13, 2013, 3:10:44 PM6/13/13

to tesser...@googlegroups.com

I am using Tesseract Version 3.02 for development in Windows. I am running OCR on an image and I get different results from the Tesseract API I write in my development environment and by executing Tesseract from command line.

Sample output from command line (which is the right one):

“t3DMarkLogic"

Tony Agresta

Worldwide VP, Field Operations

MarkLogic Corporation -

7950 Jones Branch Drive +1 703 854 8531 Phone

Suite 200 +1 703 854 8510 Fax

McLean, VA 22107 +1 443 253 6810 Mobile

- www.marklogic.com tony.agresta[at]marklogic.com

Sample output from the Tesseract API code (contains a lot of junk characters):

_ - _ _ V, f’ ._,«.-f A‘ . V ,//7-ﬁr /1”" /’..i../¢_'7*"'

ca : .- ' 8 4

‘D MarkLogic" 1

Tony Agresta . 5

Worldwide VP, Field Operations 1

* MarkLogic Corporation - V

’ - 7950 Jones Branch Drive +1 703 854 8531 Phone

Suite 200 +1 703 854 8510 Fax

McLean, VA 22107 +1 443 253 6810 Mobile

- www.marklogic.com tony.agresta[at]marklogic.com

Why are the two outputs different? And what should I change so that the output from my code matches the output produced for the same image from the command line?

zdenko podobny

unread,

Jun 14, 2013, 2:10:28 AM6/14/13

to tesser...@googlegroups.com

tesseract command line is short program that use tesseract-ocr API. So look for differences between your usage of API and tessseract executable.

Zdenko

--
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com
To unsubscribe from this group, send email to
tesseract-oc...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

---
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Reply all

Reply to author

Forward

0 new messages