Getting a blank tessinput.tif file

127 views
Skip to first unread message

Ashish Goel

unread,
Jun 6, 2016, 7:08:11 AM6/6/16
to tesseract-ocr
Hello All,

I am trying to do OCR on a bunch of images. Getting some failures, and I want to analyse them.
So, to do that, I am trying to get the tessinput.tif file so that I can find out what input actually goes to tesseract.

I am passing "-c tessedit_write_images 1" along with my tesseract to generate the tessinput.tif file.
Tesseract does generates the tessinput file, but the file is blank (0 bytes)

Did I do anything wrong?
I downloaded tesseract 3.14 and leptonica 1.73 and compiled both.

Version as reported by tesseract -v are:

tesseract 3.04.00
 leptonica-1.73
  libjpeg 8b (libjpeg-turbo 1.2.0) : libpng 1.2.46 : zlib 1.2.3.4


Any help will be gretaly appreciated...

Regards,
Ashish

Zdenko Podobný

unread,
Jun 6, 2016, 7:59:27 AM6/6/16
to tesser...@googlegroups.com
Your leptonica build support only limited number of image formats. What image you try to process?

Zdenko

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/f244961f-009c-40a7-8908-3e3bda490519%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

ashish goel

unread,
Jun 6, 2016, 12:01:29 PM6/6/16
to tesser...@googlegroups.com
I am trying to process a png image. Will it work, if I convert my png to tiff before OCRing?

Ashish Goel

unread,
Jun 7, 2016, 6:54:36 AM6/7/16
to tesseract-ocr
Hey Zdenko,
I also tried converting my image to tif/tiff, but still it did not gave me a good tessinput.tif
I found that libtiff is missing on my environment. So, I installed libtiff4-dev and recompiled leptonica.

Now my version shows up as:

tesseract 3.04.00
 leptonica-1.73
  libjpeg 8b (libjpeg-turbo 1.2.0) : libpng 1.2.46 : libtiff 3.9.5 : zlib 1.2.3.4

but still tessinput.tif is blank.

Is there anything else that I can try so that I can get tessinput.tif?

Thanks
Ashish

Zdenko Podobný

unread,
Jun 7, 2016, 7:26:41 AM6/7/16
to tesser...@googlegroups.com
What OS are you using?

Zdenko

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

Ashish Goel

unread,
Jun 7, 2016, 10:28:51 AM6/7/16
to tesseract-ocr
Ubuntu 12.04


On Monday, June 6, 2016 at 4:38:11 PM UTC+5:30, Ashish Goel wrote:

Zdenko Podobný

unread,
Jun 7, 2016, 10:37:23 AM6/7/16
to tesser...@googlegroups.com
Is there a reason why you do not use leptonica shipped by Ubuntu?
It is difficult to find where is your problem from your description. I think best approach is it to use sw packaged by your distribution in case of any problem with custom compiled sw...

Zdenko

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

Ashish Goel

unread,
Jun 7, 2016, 11:12:16 AM6/7/16
to tesseract-ocr
Zdenko,

Thanks for your reply. I will try with standard distro and let know if it works. 

Ashish


On Monday, June 6, 2016 at 4:38:11 PM UTC+5:30, Ashish Goel wrote:
Reply all
Reply to author
Forward
0 new messages