--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com.
To unsubscribe from this group, send email to tesseract-oc...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.
Just what I thought, have you check that the “tesseract.exe” is in the same folder as the “tessdata” folder?
I could provide an exe built on win7 32bit system, but I am not sure how to, since I think google blocks *.exe extension. Is it enough to rename the extension?
Mike
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com.
To unsubscribe from this group, send email to tesseract-oc...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.
Hi,
I just ran your tif file, I get no results, it must have something to do with the size of the image. If I try to run a portion of tiff something smaller than 1000x1000 then I get results.
Can somebody explain why a tif size (2480x3508 @ 8BPP) is not processed?
Mike
Von: Richard Genthner [mailto:ric...@guthnur.net]
Gesendet: Freitag, 25. März 2011 17:04
An: Lutz, Michael
Cc: tesser...@googlegroups.com
Betreff: Re: tesseract.exe has stopped working on win2008 r2
Here is the screenshot and the tif file. Dmitri if you rename the .exe that should work. I'm trying to get the traning data up.
To pass the obtrusive Gmail's security checks, you can zip the .exe
with password protection (give us the password). Or you can place the
.exe into two nested folders and change its extension, then zip and
send.
Your .tif looks okay, though.
Warm regards,
Dmitri Silaev
--
Here is the screenshot and the tif file. Dmitri if you rename the .exe that should work. I'm trying to get the traning data up.
--
Hi,
I just ran your tif file, I get no results, it must have something to do with the size of the image. If I try to run a portion of tiff something smaller than 1000x1000 then I get results.
Can somebody explain why a tif size (2480x3508 @ 8BPP) is not processed?
Mike
Von: Richard Genthner [mailto:ric...@guthnur.net]
Gesendet: Freitag, 25. März 2011 17:04
An: Lutz, Michael
Cc: tesser...@googlegroups.com
Betreff: Re: tesseract.exe has stopped working on win2008 r2
Here is the screenshot and the tif file. Dmitri if you rename the .exe that should work. I'm trying to get the traning data up.
This message is confidential and intended only for the addressee. If you have received this message in error, please immediately notify the postm...@nds.com and delete it from your system as well as any copies. The content of e-mails as well as traffic data may be monitored by NDS for employment and security purposes.
To protect the environment please do not print this e-mail unless necessary.
An NDS Group Limited company. www.nds.com
--
Warm regards,
Dmitri Silaev
The test image has 16 bpp.
> This is not tesseract but leptonica issue (library used for image handling).
> When I run it on linux I got error message comming from leptonica (1.67 -> I
> did not try 1.68 on linux yet):
> Error in pixReadFromTiffStream: spp not in set {1,3,4}
> Error in pixReadStreamTiff: pix not read
> Error in pixReadTiff: pix not read
I get same warnings on with Leptonica v1.68 on Windows XP SP3.
> On Windows leptonica "release version" library did not show error/warning
> messages because of compile option "NO_CONSOLE_IO"
> (see http://code.google.com/p/leptonica/issues/detail?id=42).
> It looks like leptonica did not support lzw compression for tiff (
> see http://www.leptonica.com/source/README.html "9. Image I/O" - lzw is
> mentioned in png and gif section, but not with tif). I change
> tif compression from lzw to zip (BTW: this will cause smaller image),
> tesseract will produce ouput (on XP SP3).
Incorrect. At least on Windows I build libtiff with "LZW_SUPPORT = 1"
in my nmake.opt file.
You can see the actual problem by looking at
http://tpgit.github.com/Leptonica/tiffio_8c_source.html#l00274, where
Leptonica gets the TIFFTAG_SAMPLESPERPIXEL. It allows 1, 3, or 4 but
not 2 as this image contains.
-- TP
On Sat, Mar 26, 2011 at 7:42 AM, zdenko podobny <zde...@gmail.com> wrote:
>> Can somebody explain why a tif size (2480x3508 @ 8BPP) is not processed?The test image has 16 bpp.
> This is not tesseract but leptonica issue (library used for image handling).I get same warnings on with Leptonica v1.68 on Windows XP SP3.
> When I run it on linux I got error message comming from leptonica (1.67 -> I
> did not try 1.68 on linux yet):
> Error in pixReadFromTiffStream: spp not in set {1,3,4}
> Error in pixReadStreamTiff: pix not read
> Error in pixReadTiff: pix not read
Incorrect. At least on Windows I build libtiff with "LZW_SUPPORT = 1"
> On Windows leptonica "release version" library did not show error/warning
> messages because of compile option "NO_CONSOLE_IO"
> (see http://code.google.com/p/leptonica/issues/detail?id=42).
> It looks like leptonica did not support lzw compression for tiff (
> see http://www.leptonica.com/source/README.html "9. Image I/O" - lzw is
> mentioned in png and gif section, but not with tif). I change
> tif compression from lzw to zip (BTW: this will cause smaller image),
> tesseract will produce ouput (on XP SP3).
in my nmake.opt file.
You can see the actual problem by looking at
http://tpgit.github.com/Leptonica/tiffio_8c_source.html#l00274, where
Leptonica gets the TIFFTAG_SAMPLESPERPIXEL. It allows 1, 3, or 4 but
not 2 as this image contains.
-- TP
I used:
ACDSee and
AsTiffTagViewer
(http://www.awaresystems.be/imaging/tiff/astifftagviewer.html) The
results are a bit complicated but pretty sure this one tells you
EXACTLY what the tiff tags contains. "Whenever a customer reports your
software doesn't handle this or that particular TIFF, use
AsTiffTagViewer and discover why." :P It's the only tag viewer I know
of that correctly shows the ImageDescription tag for each page of a
multi-page tiff. Most image viewers seem to only show the first page's
ImageDescription.
And since you asked, I also ran the libtiff 3.9.4's tiffinfo.exe to get:
TIFF Directory at offset 0x174e32 (1527346)
Image Width: 2480 Image Length: 3508
Resolution: 300, 300 (unitless)
Bits/Sample: 8
Compression Scheme: LZW
Photometric Interpretation: min-is-black
Extra Samples: 1<unassoc-alpha>
FillOrder: msb-to-lsb
Orientation: row 0 top, col 0 lhs
Samples/Pixel: 2
Rows/Strip: 1
Planar Configuration: single image plane
Page Number: 0-1
DocumentName:
C:/www/test/wwwdocs/files/ea1/ea1064bb1fdb449c28f97fa31b8e3ea6.tif
Predictor: horizontal differencing 2 (0x2)
Looks to me like some programs just look at Bits/Sample and forget to
also take into account Samples/Pixel?
-- TP
Hi All,
So the image Richard gave us is a compressed TIF file. Since tesseract only supports uncompressed TIF images as noticed by Zdenko you will not get any results from this image.
I attached the image as an uncompressed TIF file, see uncompressed.zip, this image is processed by tesseract without any problems.
Also attached is a tesseract.zip, which should unpack a tesseract.executable, just rename it to tesseract.exe if it went through, it is a release static build using Win7 and WinSDK 7.1 if anyone still wants it.
Regards,
Mike
-----Ursprüngliche Nachricht-----
Von: Dmitri Silaev [mailto:daemo...@gmail.com]
Gesendet: Samstag, 26. März 2011 22:04
Cc: zdenko podobny; Lutz, Michael; Richard Genthner
Support for uncompressed TIFFs only is not an issue for a long time!
It was only during the period when Tess used a home-brewed TIFF
input/output routines. Now Tesseract does support many TIFF variations
through the use of Leptonica.
Actually I don't use the image handling part of Tesseract, so I'm
rather interested in investigation of Tesseract's errors, not
Leptonica's.
Warm regards,
Dmitri Silaev
On Mon, Mar 28, 2011 at 4:41 PM, Lutz, Michael <ML...@nds.com> wrote:
> Sorry, you were not saying this, I mixed some stuff up when reading up on
> the issue this morning, this was what I was referring to:
>
>
>
> According irfanview, is compressed as - LZW tif file of 300 DPI What Quan
> says is correct image is heavily compressed tif one. Tesseract-OCR is
> supported only uncompressed tif file only from my experience.
>
> Sriranga(78yrsold)
>
> Thanks for pointing it out.
>
> Mike
> ---------- Forwarded message ----------
> From: "Sriranga(78yrsold)" <withbl...@gmail.com>
> To: "tesser...@googlegroups.com" <tesser...@googlegroups.com>
> Date: Sat, 26 Mar 2011 14:12:41 +0100
> Subject: Re: tesseract.exe has stopped working on win2008 r2
> According irfanview, is compressed as - LZW tif file of 300 DPI What Quan
> says is correct image is heavily compressed tif one. Tesseract-OCR is
> supported only uncompressed tif file only from my experience.
>
> On Sat, Mar 26, 2011 at 6:17 PM, Quan Nguyen <nguy...@gmail.com> wrote:
>>
>> The image appears to have been heavily compressed. OCR the whole image
>> did not yield anything. Doing it blockwise, I got some results but not
>> very accurate:
>>
>> Ch Juhe 24, 2@@9 the ACHP vctect ct: revisect teccmmehdettcns tcr
>> mee_s1es-muhqes-t'ube[[e (NR/H~
>> ‘evictetnce ct tmmuhity’ requtrementstcr heetthcete teefschheh‘. The
>> Heatthcate thtecttctn Ochtrct
>> Ptectices Aciviscry Ccmrmttee (HHCPAG) has ernctcfsed these changes.
>>
Sorry, you were not saying this, I mixed some stuff up when reading up on the issue this morning, this was what I was referring to:
According irfanview, is compressed as - LZW tif file of 300 DPI What Quan says is correct image is heavily compressed tif one. Tesseract-OCR is supported only uncompressed tif file only from my experience.
Sriranga(78yrsold)
Thanks for pointing it out.
Mike
Von: zdenko podobny [mailto:zde...@gmail.com]
Gesendet: Montag, 28. März 2011 14:34
An: Lutz, Michael
So the image Richard gave us is a compressed TIF file. Since tesseract only supports uncompressed TIF images as noticed by Zdenko you will not get any results from this image.
I attached the image as an uncompressed TIF file, see uncompressed.zip, this image is processed by tesseract without any problems.
Also attached is a tesseract.zip, which should unpack a tesseract.executable, just rename it to tesseract.exe if it went through, it is a release static build using Win7 and WinSDK 7.1 if anyone still wants it.
Regards,
Mike
-----Ursprüngliche Nachricht-----
Von: Dmitri Silaev [mailto:daemo...@gmail.com]
Gesendet: Samstag, 26. März 2011 22:04
An: tesser...@googlegroups.com
Cc: zdenko podobny; Lutz, Michael; Richard Genthner