I get the following error on a TIFF created from a PDF by ImageMagick:
tesseract file.tiff ocred -l eng
Tesseract Open Source OCR Engine with Leptonica
TIFFstream: Sorry, can not handle image.
Error in pixReadFromTiffStream: failed to read tiffdata
Error in pixReadStreamTiff: pix not read
Error in pixReadTiff: pix not read
TIFF created from a PDF with ImageMagick:
convert file.pdf -depth 4 file.tiff
TIFF information (truncated):
tiffinfo file.tiff
TIFF Directory at offset 0xb792a (751914)
Subfile Type: multi-page document (2 = 0x2)
Image Width: 595 Image Length: 842
Resolution: 72, 72 (unitless)
Bits/Sample: 4
Compression Scheme: None
Photometric Interpretation: RGB color
FillOrder: msb-to-lsb
Orientation: row 0 top, col 0 lhs
Samples/Pixel: 3
Rows/Strip: 9
Planar Configuration: single image plane
Page Number: 0-35
DocumentName: scanned.tiff
Software: ImageMagick 6.6.2-6 2010-12-02 Q16
http://www.imagemagick.org
TIFF Directory at offset 0x16f688 (1504904)
Subfile Type: multi-page document (2 = 0x2)
Image Width: 595 Image Length: 842
[...]
Tesseract 3.0 compiled from source (leptonica and libtiff installed)
on Ubuntu 2010.04.
Is there a problem with my TIFF?
Could someone point me to a TIFF file that is know to work with
Tesseract?
Thanks a lot!
Nicolas Raoul
ECM consultant in Tokyo