sometimes getting full resolution as word rect in tsv

19 views
Skip to first unread message

mickey...@gmail.com

unread,
May 6, 2019, 1:02:46 AM5/6/19
to tesseract-ocr

Hello I'm doing a simple command like this:

tesseract thumb0546.jpg outputbase tsv

The issue is that for one of the words, the letter 'a' it's giving me the full image size as the rect containing the word.

5 1 1 1 3 2 0 0 640 360 96 a

I'm using  OS X.  Here's the version info.  Image and full tsv attached. Anyone know how to fix this? 

tesseract -v
tesseract 4.0.0
 leptonica-1.77.0
  libgif 5.1.4 : libjpeg 9c : libpng 1.6.36 : libtiff 4.0.10 : zlib 1.2.11 : libwebp 1.0.2 : libopenjp2 2.3.0
 Found AVX2
 Found AVX
 Found SSE

thumb0546.jpg
outputbase.tsv

Zdenko Podobny

unread,
May 6, 2019, 2:24:26 AM5/6/19
to tesser...@googlegroups.com
Use the latest code - it should be fixed.

Zdenko


po 6. 5. 2019 o 7:02 <mickey...@gmail.com> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/07fb0fe0-344c-4a18-a32a-70b0fb815421%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Reply all
Reply to author
Forward
0 new messages