It says the following on the FAQ, and it seems Ray's
been saying on the bug tracker since
November that these things are fixed in 2.04,
but they
don't seem to be - am I missing something?
"Without libtiff, Tesseract only reads uncompressed tiff files. Even
then it won't read 32 bit tiff files correctly. Will be fixed in 2.04.
With libtiff, Tesseract reads compressed tiff files, but can't handle
any color: 24 or 32 bit. It can only read 1 bit binary images or 8 bit
greyscale. (No color maps!) Will be fixed in 2.04"
I also saw a comment
somewhere that support for formats other than tiff will be available in
version 3 via leptonica, so I'm wondering what functionality leptonica
support currently provides?
Sorry about that misleading comment. I have improved the FAQ. The fix in
2.04 is that it works properly with libtiff, NOT that it reads more tiff
files without it.
Leptonica itself likes to have (doesn't absolutely need) additional imaging
libraries (tiff, jpg, png, gif) and then can read all those formats. This is
going to make 3.00 more difficult to build/install, so I have been hesitant
to make leptonica a necessary library. So far it still runs without it, but
the page layout analysis is severely impaired without it, so leptonica will
be the default.
On Fri, Jul 3, 2009 at 11:38 PM, Hood Family <hoodb...@maxnet.co.nz> wrote:
> It says the following on the FAQ, and it seems Ray's been saying on the
> bug tracker since
> November that these things are fixed in 2.04, but they don't seem to be -
> am I missing something?
> "Without libtiff, Tesseract only reads uncompressed tiff files. Even then
> it won't read 32 bit tiff files correctly. Will be fixed in 2.04.
> With libtiff, Tesseract reads compressed tiff files, but can't handle any
> color: 24 or 32 bit. It can only read 1 bit binary images or 8 bit
> greyscale. (No color maps!) Will be fixed in 2.04"
> I also saw a comment somewhere that support for formats other than tiff
> will be available in version 3 via leptonica, so I'm wondering what
> functionality leptonica support currently provides?
> Leptonica itself likes to have (doesn't absolutely need) additional > imaging libraries (tiff, jpg, png, gif) and then can read all those > formats. This is going to make 3.00 more difficult to build/install, so > I have been hesitant to make leptonica a necessary library. So far it
...
A notion of "self-sufficiency" for the package of such complexity seems like a sort of self-delusion, anyway. I've seen projects (one project, anyway) with beautiful idea in the core, which ran crippled because of following such a notion.
Nothing wrong in depending on such ubiquitous and strongly specialised libraries as libtiff, libpng, libjpeg.
OK, I'm still confused.
I'm guessing that first line should also be changed - to "will be
fixed in 3".
But I still find I can only use black and white files with libtiff in
2.04. Is the fix actually that tesseract produces an error message
instead of meaningless output?
On Jul 7, 2:25 pm, Ray Smith <theraysm...@gmail.com> wrote:
> Sorry about that misleading comment. I have improved the FAQ. The fix in
> 2.04 is that it works properly with libtiff, NOT that it reads more tiff
> files without it.
> Leptonica itself likes to have (doesn't absolutely need) additional imaging
> libraries (tiff, jpg, png, gif) and then can read all those formats. This is
> going to make 3.00 more difficult to build/install, so I have been hesitant
> to make leptonica a necessary library. So far it still runs without it, but
> the page layout analysis is severely impaired without it, so leptonica will
> be the default.
> Ray.
> On Fri, Jul 3, 2009 at 11:38 PM, Hood Family <hoodb...@maxnet.co.nz> wrote:
> > It says the following on the FAQ, and it seems Ray's been saying on the
> > bug tracker since
> > November that these things are fixed in 2.04, but they don't seem to be -
> > am I missing something?
> > "Without libtiff, Tesseract only reads uncompressed tiff files. Even then
> > it won't read 32 bit tiff files correctly. Will be fixed in 2.04.
> > With libtiff, Tesseract reads compressed tiff files, but can't handle any
> > color: 24 or 32 bit. It can only read 1 bit binary images or 8 bit
> > greyscale. (No color maps!) Will be fixed in 2.04"
> > I also saw a comment somewhere that support for formats other than tiff
> > will be available in version 3 via leptonica, so I'm wondering what
> > functionality leptonica support currently provides?
Sorry, it is supposed to work, but it is not a well tested path, as I have
been using leptonica for most of my file i/o for a while now.I have made a
not to investigate and will fix properly in 3.00.
Ray.
On Wed, Jul 8, 2009 at 11:40 PM, disciple <hoodb...@maxnet.co.nz> wrote:
> OK, I'm still confused.
> I'm guessing that first line should also be changed - to "will be
> fixed in 3".
> But I still find I can only use black and white files with libtiff in
> 2.04. Is the fix actually that tesseract produces an error message
> instead of meaningless output?
> On Jul 7, 2:25 pm, Ray Smith <theraysm...@gmail.com> wrote:
> > Sorry about that misleading comment. I have improved the FAQ. The fix in
> > 2.04 is that it works properly with libtiff, NOT that it reads more tiff
> > files without it.
> > Leptonica itself likes to have (doesn't absolutely need) additional
> imaging
> > libraries (tiff, jpg, png, gif) and then can read all those formats. This
> is
> > going to make 3.00 more difficult to build/install, so I have been
> hesitant
> > to make leptonica a necessary library. So far it still runs without it,
> but
> > the page layout analysis is severely impaired without it, so leptonica
> will
> > be the default.
> > Ray.
> > On Fri, Jul 3, 2009 at 11:38 PM, Hood Family <hoodb...@maxnet.co.nz>
> wrote:
> > > It says the following on the FAQ, and it seems Ray's been saying on
> the
> > > bug tracker since
> > > November that these things are fixed in 2.04, but they don't seem to
> be -
> > > am I missing something?
> > > "Without libtiff, Tesseract only reads uncompressed tiff files. Even
> then
> > > it won't read 32 bit tiff files correctly. Will be fixed in 2.04.
> > > With libtiff, Tesseract reads compressed tiff files, but can't handle
> any
> > > color: 24 or 32 bit. It can only read 1 bit binary images or 8 bit
> > > greyscale. (No color maps!) Will be fixed in 2.04"
> > > I also saw a comment somewhere that support for formats other than tiff
> > > will be available in version 3 via leptonica, so I'm wondering what
> > > functionality leptonica support currently provides?