Which revision of tesseract 3.0 for win7 64bit

115 views
Skip to first unread message

Joe Degenhardt

unread,
Aug 19, 2010, 6:23:29 AM8/19/10
to tesseract-ocr
Dear Sir or Madam,

I would like to know which revision of tesseract 3.0 is recommendable
to use under win7 64bit for OCR purposes at the moment? I have
recently tried several revisions: I compiled them with VS2008 in
release mode and tested the OCR functionality by running tesseract.exe
with the tif images attached to the source code. Without more ado
Rev300-326 were working, but all revisions beyond 350 either do not
compile or crash when running tesseract.exe. The latest revision in
the svn can be compiled and does not crash so far but stops with an
error message which has already been reported in issue 345.
I have read several forum posts and comments for several issues so far
and I got the impression the recent revisions might not work under
win7 64(or any kind of MS OS) without some kind of "hack" but I could
have missed some of the conception. I would like to know if this is
the current state or if someone is using one of the newer revisions
under a MS OS(especially win7 64) without such problems? Maybe I am
just missing some minor but crucial part.
What I would also like to know is if there is a big difference between
the revisions that are working for me (rev326) and the current one,
considering that I need OCR for a language consisting mostly of
english and a focus on a few(but not exclusivly those few) fonts?

Best Regards,
Joe Degenhardt

Jimmy O'Regan

unread,
Aug 19, 2010, 2:49:35 PM8/19/10
to tesser...@googlegroups.com
On 19 August 2010 11:23, Joe Degenhardt <joe.deg...@googlemail.com> wrote:
> Dear Sir or Madam,
>
> I would like to know which revision of tesseract 3.0 is recommendable
> to use under win7 64bit for OCR purposes at the moment? I have
> recently tried several revisions: I compiled them with VS2008 in
> release mode and tested the OCR functionality by running tesseract.exe
> with the tif images attached to the source code. Without more ado
> Rev300-326 were working, but all revisions beyond 350 either do not
> compile or crash when running tesseract.exe. The latest revision in
> the svn can be compiled and does not crash so far but stops with an
> error message which has already been reported in issue 345.
> I have read several forum posts and comments for several issues so far
> and I got the impression the recent revisions might not work under
> win7 64(or any kind of MS OS) without some kind of "hack" but I could
> have missed some of the conception. I would like to know if this is
> the current state or if someone is using one of the newer revisions
> under a MS OS(especially win7 64) without such problems? Maybe I am
> just missing some minor but crucial part.

No, that's the state of things.

> What I would also like to know is if there is a big difference between
> the revisions that are working for me (rev326) and the current one,
> considering that I need OCR for a language consisting mostly of
> english and a focus on a few(but not exclusivly those few) fonts?

Revision 326 is in the Tesseract 2 series.

--
<Leftmost> jimregan, that's because deep inside you, you are evil.
<Leftmost> Also not-so-deep inside you.

Max

unread,
Aug 19, 2010, 5:45:53 PM8/19/10
to tesseract-ocr

On Aug 19, 11:49 am, "Jimmy O'Regan" <jore...@gmail.com> wrote:
> On 19 August 2010 11:23, Joe Degenhardt <joe.degenha...@googlemail.com> wrote:
>
> No, that's the state of things.
>

hmm... The latest code compiles and works for me :). May be I should
have mentioned that only the release mode is compilable and runnable
"out of the box", since leptonlibd.dll (debug version of leptonica) is
missing on svn.

>The latest revision in
>the svn can be compiled and does not crash so far but stops with an
>error message which has already been reported in issue 345.

To workaround issue 345 you can revert change #r448 .

max

zdenko podobny

unread,
Aug 23, 2010, 7:19:47 AM8/23/10
to tesser...@googlegroups.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com.
To unsubscribe from this group, send email to tesseract-oc...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.


There is new version (1.66) of leptonica [1] including precompiled windows libraries. I copied
lib/*.lib from leptonica-1.66-win32-lib-include-dirs.zip to tesseract\lib  and lib/*.dll to tesseract\ directory..

Than I was able to build Debug and Release version tesseract and these commands work :-):

tesseract.exe phototest.tif phototest
tesseract.exe phototest.tif phototest batch.nochop makebox
tesseract.exe phototest.tif phototest nobatch box.train

This is first time when I was able to compile tesseract on windows and it produce output :-). I have almost no experience with compiling sw on Windows so it would be great is somebody can check this or provide better process. 


BR,

Zd.

zdenko podobny

unread,
Aug 23, 2010, 8:57:16 AM8/23/10
to tesser...@googlegroups.com
Just remake: I compiled this way tesseract r454 in Visucal C++ 2008 Express Edition on Windows XP SP3.

Jimmy O'Regan

unread,
Aug 23, 2010, 9:21:15 AM8/23/10
to tesser...@googlegroups.com
On 23 August 2010 12:19, zdenko podobny <zde...@gmail.com> wrote:
>
> There is new version (1.66) of leptonica [1] including precompiled windows
> libraries. I copied
> lib/*.lib from leptonica-1.66-win32-lib-include-dirs.zip to tesseract\lib
>  and lib/*.dll to tesseract\ directory..
> Than I was able to build Debug and Release version tesseract and
> these commands work :-):
> tesseract.exe phototest.tif phototest
> tesseract.exe phototest.tif phototest batch.nochop makebox
> tesseract.exe phototest.tif phototest nobatch box.train
> This is first time when I was able to compile tesseract on windows and it
> produce output :-). I have almost no experience with compiling sw on Windows
> so it would be great is somebody can check this or provide better process.
> [1] http://code.google.com/p/leptonica/downloads/list

As it happens, I'm at a Windows box for the day, so I'll see about
updating the binaries later.

Quan Nguyen

unread,
Aug 23, 2010, 7:03:21 PM8/23/10
to tesseract-ocr
I am able to confirm Tesseract r454 with new Leptonica-1.66 binary ran
w/o the problem that was reported in Issue 304. Well, with one little
other problem, though:

Could not open file, ./tessdata/eng.user-words

I had to create an empty file with the name to get it to run. When I
tried with -l vie, it again put out another error:

Could not open file, ./tessdata/vie.user-words

The program should be able to continue w/o any *.user-words files.

Thanks.

zdenko podobny

unread,
Aug 26, 2010, 3:56:02 AM8/26/10
to tesser...@googlegroups.com
Reply all
Reply to author
Forward
0 new messages