Tesseract doesn't work with a very simple example

1,460 views
Skip to first unread message

Felipe Coutinho

unread,
Jun 17, 2011, 10:05:26 AM6/17/11
to tesser...@googlegroups.com
Hello,

I'm a new tess user. I'm trying to test the tess with this very simple text image and I didn't succeed. This was the text that was recognized: vefysiqxe. I used this command line: tesseract C:\Users\felipelc\Desktop\simple.tif C:\Users\felipelc\Desktop\out
What am I doing wrong?

Regards,

Felipe.

--
Felipe Leal Coutinho
http://www.felipelc.com/
http://www.facebook.com/felipelcoutinho
http://twitter.com/felipelcout

Softaware Soluções em Informática
http://www.softaware.emp.br

simple.tif

patrickq

unread,
Jun 17, 2011, 10:34:45 AM6/17/11
to tesseract-ocr
I don't think you are doing anything wrong - I tested this with
ScanBizCards (Tesseract 3.01) and I get Very mpxe (note the same
mistake of x instead of l). I think this is yet another example of
Tesseract's poor recognition whenever it has either too little
information about height (as this case, with just two words) or
different font sizes on a page (this is the case that trips Tesseract
most often).

Patrick

On Jun 17, 10:05 am, Felipe Coutinho <felipelcouti...@gmail.com>
wrote:
> Hello,
>
> I'm a new tess user. I'm trying to test the tess with this very simple text
> image and I didn't succeed. This was the text that was recognized: *
> vefysiqxe*. I used this command line: *tesseract
> C:\Users\felipelc\Desktop\simple.tif C:\Users\felipelc\Desktop\out*
> What am I doing wrong?
>
> Regards,
>
> Felipe.
>
> --
> Felipe Leal Coutinhohttp://www.felipelc.com/http://www.facebook.com/felipelcoutinhohttp://twitter.com/felipelcout
>
> Softaware Soluções em Informáticahttp://www.softaware.emp.br
>
>  simple.tif
> 1KViewDownload

zdenko podobny

unread,
Jun 17, 2011, 10:34:42 AM6/17/11
to tesser...@googlegroups.com
First of all - please read documentation e.g. [1]. It can save your time ;-).


--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com
To unsubscribe from this group, send email to
tesseract-oc...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Quan Nguyen

unread,
Jun 18, 2011, 9:41:34 AM6/18/11
to tesseract-ocr
The resolution of your image is too low -- at 96 DPI, any OCR engine
would have problem with it. After rescaling to 300 DPI, Tesseract was
able to recognize it.

On Jun 17, 9:05 am, Felipe Coutinho <felipelcouti...@gmail.com> wrote:
> Hello,
>
> I'm a new tess user. I'm trying to test the tess with this very simple text
> image and I didn't succeed. This was the text that was recognized: *
> vefysiqxe*. I used this command line: *tesseract
> C:\Users\felipelc\Desktop\simple.tif C:\Users\felipelc\Desktop\out*
> What am I doing wrong?
>
> Regards,
>
> Felipe.
>
> --
Reply all
Reply to author
Forward
0 new messages