no output for a simple test image

40 views
Skip to first unread message

Jeremy Colton

unread,
Aug 19, 2016, 2:00:58 PM8/19/16
to tesseract-ocr
Hi,

i have the latest tesseract-ocr:

tesseract 3.03
 leptonica-1.70
  libgif 4.1.6(?) : libjpeg 8d : libpng 1.2.50 : libtiff 4.0.3 : zlib 1.2.8 : webp 0.4.0

I created an image in paint with pixels 12px and 20px numbers.  See attached.

I ran tesseract a.png out digits

And I get an empty out.txt file.

This is the simplest test and it fails!  What simple config setup have I missed?

Many thanks
Jeremy
b.png

Allistair

unread,
Aug 19, 2016, 2:32:57 PM8/19/16
to tesser...@googlegroups.com
Your image is not very "linear" in the sense that a word document is and which Tesseract by default does best with. You should try another page segmentation. I tried -psm 6 and got 

l

6

3

s

2

‘ 7

9 5

Note this is not 100% accurate so you might then try increasing the resolution of your image.

New users make all the same mistakes - wrong PSM and wrong resolution - this is mentioned in the Tesseract documentation.

https://github.com/tesseract-ocr/tesseract/wiki/ImproveQuality


--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/6b1a6ddf-c863-4a2e-b077-e7f76760f6cf%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply all
Reply to author
Forward
0 new messages