Trying to extract the text from image and tesseract is no returning text correctly.

64 views
Skip to first unread message

Durai K

unread,
Jan 1, 2020, 3:16:06 AM1/1/20
to tesseract-ocr
Hi,

I have following tesseract  version installed on Windows 10

tesseract v5.0.0-alpha.20191030
 leptonica-1.78.0
  libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 1.5.3) : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.0
 Found AVX2
 Found AVX
 Found FMA
 Found SSE
 Found libarchive 3.3.2 zlib/1.2.11 liblzma/5.2.3 bz2lib/1.0.6 liblz4/1.7.5

I am trying to extract the text from attached image. But it does not work and returns following error

Warning: Invalid resolution 0 dpi. Using 70 instead.
Estimating resolution as 450
Empty page!!
Estimating resolution as 450
Empty page!!

Also it generated attached tiff file when I ran tesseract command as (tesseract captcha5.png stdout -c tessedit_write_images=true)

Can someone please advise how we can extract the text here?.

Regards,
Durai.

captcha5.png
tessinput.tif

Zdenko Podobny

unread,
Jan 1, 2020, 10:25:13 AM1/1/20
to tesser...@googlegroups.com
Search internet (at the least this forum) for tesseract and captcha

Zdenko


st 1. 1. 2020 o 9:16 Durai K <durai.ka...@gmail.com> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/bf985f87-0648-438c-9c7f-0e7583678eca%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages