Tesseract 3.04 error.

1,245 views
Skip to first unread message

Sriram Varadharajan

unread,
Sep 15, 2015, 11:40:57 PM9/15/15
to tesseract-ocr
I took the 3.04 version of tesseract by following 

brew install tesseract --HEAD.


After install when i do tesseract --version i get 

tesseract 3.04.00

 leptonica-1.71

  libjpeg 8d : libpng 1.6.17 : libtiff 4.0.3 : zlib 1.2.5




Now the problem part 


when i try to run tesseract command  like 

tesseract /Users/test/Documents/tmp/Identity1-77023.0.png output.txt

Tesseract Open Source OCR Engine v3.04.00 with Leptonica

Page 1

Warning in pixReadMemTiff: tiff page 1 not found   ----------------  What does this mean . I am only sending an png image and not a page???



I searched the best i can on the internet to get a solution and ended up nowhere . I cannot go back to version 3.02 because of the lack of support to HOCR. Also i attempted manually compiling tesseract 3.03 but with no success . 



If any one could advise me as to how to brew version 3.01 of tesseract please do so.  I cannot seem to find the right command . 


Thanks



ShreeDevi Kumar

unread,
Sep 16, 2015, 12:35:17 AM9/16/15
to tesser...@googlegroups.com

Did u check if the output file is created?

That is just a warning from leptonica.

- sent from my phone. excuse the brevity and typos.

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/51f97baf-0793-46db-ab07-3a5cbaf20476%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

ShreeDevi Kumar

unread,
Sep 16, 2015, 12:36:47 AM9/16/15
to tesser...@googlegroups.com

Does your input filename have a space in it?

- sent from my phone. excuse the brevity and typos.

Sriram Varadharajan

unread,
Sep 16, 2015, 12:35:16 PM9/16/15
to tesseract-ocr
No it does not have space in it. You are correct in saying that it is just a warning and indeed it produces the output txt when i run it as a command Line client . I invoke the Tesseract UNIX process from my java application and during that it is basically erroring out . 

Tesseract Errors: Page 1

Warning in pixReadMemTiff: tiff page 1 not found  


Instead of going and solving this problem which could be bacause i am using 3.03 version i would like to downgrade to 3.01 version which supports HOCR option.How do i install packaged 3.01 version of tesseract in mac. I don't want to go through the route of compiling it myself .

ShreeDevi Kumar

unread,
Sep 16, 2015, 12:46:49 PM9/16/15
to tesser...@googlegroups.com, Quan Nguyen

If you are having trouble using it with Java, Quan maybe able to suggest a solution.

- sent from my phone. excuse the brevity.

Sriram Varadharajan

unread,
Sep 16, 2015, 2:37:55 PM9/16/15
to tesseract-ocr
I also tried to compile 3.01 by checking out from SVN .When I run it on OS X Yosemite (10.10.5) i get compilation errors referenced in the below link which apparently got fixed in 3.02 .


At this point i don't have a choice to wait for tesseract 3.3 updates to run on OS X .

Might be a naive question - Is 3.3 still in development ? Has it been tested on MAC OS X . When will there be a packaged candidate ? Any pointers would be great !! 

Appreciate all the help .

Thanks
 . 

zdenko podobny

unread,
Sep 17, 2015, 5:57:13 AM9/17/15
to tesser...@googlegroups.com
First of all - if you need help, provide original image for investigation.

Next I do not understand why you try compile old code (3.01) from SVN.  It does not make sense - we switched to git and github.com, there were a lot of bugfixes related to different platform including Mac. If you want to compile something from source - always use the latest stable version or the latest code from master repository (but expect some problems).


Zdenko

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.

Nick White

unread,
Sep 17, 2015, 6:15:58 AM9/17/15
to tesser...@googlegroups.com
On Wed, Sep 16, 2015 at 10:16:40PM +0530, ShreeDevi Kumar wrote:
> If you are having trouble using it with Java, Quan maybe able to suggest a
> solution.

I agree, this sounds more like a Java issue to me. I don't know Java
at all, but if it's treating anything that sends output to stderr as
failing that should be something you can easily fix by changing the
behaviour of your java code.

Certainly compiling an older version of Tesseract (which, as Zdenko
says, has significantly worse OS X support) is not the correct way
to go.

Nick

S J

unread,
Aug 20, 2022, 2:20:56 AM8/20/22
to tesseract-ocr
Hi sir 

I am facing this error please help
Capture-2.PNG
Reply all
Reply to author
Forward
0 new messages