Awful results within a block text

135 views
Skip to first unread message

Kathrin- Jennifer

unread,
Jan 13, 2015, 3:50:50 AM1/13/15
to tesser...@googlegroups.com
Hi,

I'm currently playing around with Tesseract since the current results I get are not satisfying. I can't figure out what I'm doing wrong. I'm using Tesseract 3.03 and Leptonica 1.71 (OS MAC).
TESSDATA_PREFIX is set to the installation dir of Tesseract.

The input file is attached. The other files are set as follows:

eng.user-words

the
quick
brown
fox
jumped

config

user_words_suffix user-words

The terminal command I use:
tesseract phototest.png output -l eng config

And this is the awesome output....

Thu ,5 2. m of 12 Liam! an to test the
air (an: and 522 yr n works an a“ lying;
of rm mm.

the Quxzk brown dog name over mg
my m. m muzk brown dog mmaen
over 2»: my m. m muzk brawn daa
mwen aver mg my m. m muzk
brawn my mum aver mg my fax.

I also tried every - psm configuration, but none of the results was close to be useful. Any hints and ideas? Thanks in advance!
phototest.png

Allistair

unread,
Jan 13, 2015, 6:09:28 AM1/13/15
to tesser...@googlegroups.com
It's too small. Try resizing your image to 1000px wide and it works perfectly.

Before resize:


Thu ,5 2. m of 12 Liam! an to test the
air (an: and 522 yr n works an a“ lying;
of rm mm.

the Quxzk brown dog name over mg
my m. m muzk brown dog mmaen
over 2»: my m. m muzk brawn daa
mwen aver mg my m. m muzk
brawn my mum aver mg my fax.

After resize to 1000px wide:

This is a lot of 12 point text to test the
ocr code and see if it works on all types
of file format.

The quick brown dog jumped over the
lazy fox. The quick brown dog jumped
over the lazy fox. The quick brown dog
jumped over the lazy fox. The quick
brown dog jumped over the lazy fox.


Cheers

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/a15b5534-567f-4598-80ca-02a7a62fa602%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Vasin Soparkdithapong

unread,
Jan 13, 2015, 4:55:42 PM1/13/15
to tesser...@googlegroups.com
Hi Allistair,

Would you happen to know what the recommended (ideal) image size is for Tesseract 3.03? Is the suggested resolution still 300 dpi?

Thanks!
Vasin

Allistair C

unread,
Jan 14, 2015, 3:24:43 AM1/14/15
to tesser...@googlegroups.com
See my reply to the other post yesterday on detail from the faq. It applies to you.

Sent from my iPhone
Reply all
Reply to author
Forward
0 new messages