Skip to first unread message

Hood Family

unread,
Jul 4, 2009, 2:38:15 AM7/4/09
to tesser...@googlegroups.com
It says the following on the FAQ, and it seems Ray's been saying on the bug tracker since
November that these things are fixed in 2.04, but they don't seem to be - am I missing something? 

"Without libtiff, Tesseract only reads uncompressed tiff files. Even then it won't read 32 bit tiff files correctly. Will be fixed in 2.04.
With libtiff, Tesseract reads compressed tiff files, but can't handle any color: 24 or 32 bit. It can only read 1 bit binary images or 8 bit greyscale. (No color maps!) Will be fixed in 2.04"

I also saw a comment somewhere that support for formats other than tiff will be available in version 3 via leptonica, so I'm wondering what functionality leptonica support currently provides?

Thanks.

Ray Smith

unread,
Jul 6, 2009, 10:25:47 PM7/6/09
to tesser...@googlegroups.com
Sorry about that misleading comment. I have improved the FAQ. The fix in 2.04 is that it works properly with libtiff, NOT that it reads more tiff files without it.

Leptonica itself likes to have (doesn't absolutely need) additional imaging libraries (tiff, jpg, png, gif) and then can read all those formats. This is going to make 3.00 more difficult to build/install, so I have been hesitant to make leptonica a necessary library. So far it still runs without it, but the page layout analysis is severely impaired without it, so leptonica will be the default.

Ray.

Yury Tarasievich

unread,
Jul 7, 2009, 2:16:44 AM7/7/09
to tesser...@googlegroups.com
Ray Smith wrote:
...

> Leptonica itself likes to have (doesn't absolutely need) additional
> imaging libraries (tiff, jpg, png, gif) and then can read all those
> formats. This is going to make 3.00 more difficult to build/install, so
> I have been hesitant to make leptonica a necessary library. So far it
...

A notion of "self-sufficiency" for the package
of such complexity seems like a sort of
self-delusion, anyway. I've seen projects (one
project, anyway) with beautiful idea in the
core, which ran crippled because of following
such a notion.

Nothing wrong in depending on such ubiquitous
and strongly specialised libraries as libtiff,
libpng, libjpeg.

--

disciple

unread,
Jul 9, 2009, 2:40:36 AM7/9/09
to tesseract-ocr
OK, I'm still confused.
I'm guessing that first line should also be changed - to "will be
fixed in 3".
But I still find I can only use black and white files with libtiff in
2.04. Is the fix actually that tesseract produces an error message
instead of meaningless output?

On Jul 7, 2:25 pm, Ray Smith <theraysm...@gmail.com> wrote:
> Sorry about that misleading comment. I have improved the FAQ. The fix in
> 2.04 is that it works properly with libtiff, NOT that it reads more tiff
> files without it.
> Leptonica itself likes to have (doesn't absolutely need) additional imaging
> libraries (tiff, jpg, png, gif) and then can read all those formats. This is
> going to make 3.00 more difficult to build/install, so I have been hesitant
> to make leptonica a necessary library. So far it still runs without it, but
> the page layout analysis is severely impaired without it, so leptonica will
> be the default.
>
> Ray.
>

Ray Smith

unread,
Jul 10, 2009, 1:27:57 AM7/10/09
to tesser...@googlegroups.com
Sorry, it is supposed to work, but it is not a well tested path, as I have been using leptonica for most of my file i/o for a while now.
I have made a not to investigate and will fix properly in 3.00.
Ray.
Reply all
Reply to author
Forward
0 new messages