Gmail Calendar Documents Reader Web more »
Recently Visited Groups | Help | Sign in
Google Groups Home
Packaging OCRopus
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  8 messages - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Étienne Bersac  
View profile  
 More options Jun 29 2007, 10:57 am
From: Étienne Bersac <bersac...@laposte.net>
Date: Fri, 29 Jun 2007 16:57:03 +0200
Local: Fri, Jun 29 2007 10:57 am
Subject: Packaging OCRopus
Hi all,

You know i'm developing gnome-scan. I want to provide an OCR sink plugin
for end august based on OCRopus. As usual, when i use a development
software dependancy, i build a deb package.

I got OCRopus up and running. This required a trivial patch to tesseract
SVN (see :
http://code.google.com/p/tesseract-ocr/issues/detail?id=36&can=2&q= ).
Would be nice to get it fixed soon. (Why not releasing it ?).

Also, OCRopus use Autotools + Jam. However, i don't see anyway to get a
tarball. Of course, make distcheck is useless here, but jam does not
provide an equivalent :x. Also, OCRopus does not provide any tarball.

So, before packaging, is suggest to distribute OCRopus itself. :) I'm
really waiting to integrate it well in gnome-scan !

Regards,
Étienne.
--
Verso l'Alto !


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Thomas Breuel  
View profile  
 More options Jun 30 2007, 1:13 pm
From: Thomas Breuel <tmb...@gmail.com>
Date: Sat, 30 Jun 2007 10:13:02 -0700
Local: Sat, Jun 30 2007 1:13 pm
Subject: Re: Packaging OCRopus
Hi,

thanks for your feedback.  Keep in mind that OCRopus is still "pre-
alpha" precisely so that we can get feedback on the build system and  
architecture.

On Jun 29, 2007, at 7:57 AM, Étienne Bersac wrote:

> I got OCRopus up and running. This required a trivial patch to  
> tesseract
> SVN (see :
> http://code.google.com/p/tesseract-ocr/issues/detail?id=36&can=2&q= ).
> Would be nice to get it fixed soon. (Why not releasing it ?).

Ray Smith is the primary contact for Tesseract check-ins; could you  
ping him again, please?

> Also, OCRopus use Autotools + Jam. However, i don't see anyway to  
> get a
> tarball. Of course, make distcheck is useless here, but jam does not
> provide an equivalent :x. Also, OCRopus does not provide any tarball.

Well, there are two choices.

First, we could add the necessary targets to the Jamfile.    What  
would be needed in the Jamfile for easy Debian packaging?

Second, while we don't like using make for development work, creating  
a separate automake-based build for the packaging should be pretty easy.

Which one would be better for you?  Which one could you help with?

Another area we haven't decided on yet is how to turn OCRopus into a  
shared library.  There's the obvious, simple way of doing it on  
Linux, but providing a separate plain-C interface and exposing that  
as the shared library interface might be better (since it permits  
direct calls from FFIs and avoid Windows DLL issues related to C++).  
Any suggestions/input?

Cheers,
Thomas.


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Étienne Bersac  
View profile  
 More options Jun 30 2007, 1:18 pm
From: Étienne Bersac <bersac...@laposte.net>
Date: Sat, 30 Jun 2007 19:18:36 +0200
Local: Sat, Jun 30 2007 1:18 pm
Subject: Re: Packaging OCRopus
Hi,

I discuss this with my mentor. Two solution came in mind : either fix
the building with Jam to generate tarball or migration to automake.

I search for jam documentation and the official web site was very hard
to find for no gain compared to well documented automake. Also, i agree
autoconf leads to messy configure.ac, however, automake is quite good
and is complete (dist, distcheck, and friends).

I started writing full autotools build system for ocropus on top of SVN.
I will send the patch asap. Don't take it as an offense, but i find that
"make replacement" often forget automake and lead to such situation of
manual coding. I don't mean jam is the wrong solution at all, it's just
not suitable for autotools replacement yet.

Expects some patch in the near future. :)

Regards,
Étienne.
--
Verso l'Alto !


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Bill Janssen  
View profile  
 More options Jun 30 2007, 11:55 pm
From: "Bill Janssen" <bill.jans...@gmail.com>
Date: Sat, 30 Jun 2007 22:55:40 -0500
Local: Sat, Jun 30 2007 11:55 pm
Subject: Re: Packaging OCRopus
Here's another vote for automake instead of Jam.

Bill

On 6/30/07, Étienne Bersac <bersac...@laposte.net> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Thomas Breuel  
View profile  
 More options Jul 1 2007, 7:52 pm
From: "Thomas Breuel" <tmb...@gmail.com>
Date: Mon, 2 Jul 2007 01:52:47 +0200
Local: Sun, Jul 1 2007 7:52 pm
Subject: Re: Packaging OCRopus

I wrote the current automake configuration for Tesseract, so I'm familiar
with automake. Believe me, I don't like using unusual tools for building,
but automake just has too big a risk of producing incorrect output during
day-to-day development for a project like OCRopus.

Jam can be a pita and it's not very well documented.  OTOH, it's simple,
fast, mature, pretty widely used, and it usually does the right thing.  For
example, you can change into a subdirectory, type "jam", and it will update
the targets in that directory and anything they depend on, and it will do so
quickly and correctly.

Right now, we aren't planning on "migrating" to automake; what we can do is
try keep an automake configuration in OCRopus for the benefit of packagers,
in addition to the regular jam-based builds, and see how that works.  This
is, incidentally, also how automake is used in Tesseract: its primary
developers use different build systems during development, and automake is
just used by packagers.

So, any automake configuration for OCRopus should be kept extremely simple.
It should only produce the top-level targets that are of relevance to
packagers and nothing else.  Please keep that in mind when trying to create
an automake configuration--don't try to reproduce all the Jamfile
functionality.

Cheers,
Thomas.

On 6/30/07, Étienne Bersac <bersac...@laposte.net> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Étienne Bersac  
View profile  
 More options Jul 2 2007, 3:33 am
From: Étienne Bersac <bersac...@laposte.net>
Date: Mon, 02 Jul 2007 09:33:35 +0200
Local: Mon, Jul 2 2007 3:33 am
Subject: Re: Packaging OCRopus

Hi Thomas,

I started the patch for adding building of libraries and ocropus. I have
two issues :

First, ocropus use e.g. #include "imgio.h" instead of #include
"../imgio/imgio.h" . I don't understand why and when it works or not.

Second, i have problem with linking ocropus with tesseract. I find some
odd "PartialLinking" in Jamfile i don't understand.

Also, you may notice the bug report i filed for tesseract + autoheader
bug ?
http://code.google.com/p/tesseract-ocr/issues/detail?id=39&can=2&q=

Please help.

Regards,
Étienne.
--
Verso l'Alto !

  automake.diff
10K Download

    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Ilya Mezhirov  
View profile  
 More options Jul 3 2007, 1:41 pm
From: Ilya Mezhirov <mezhi...@googlemail.com>
Date: Tue, 03 Jul 2007 17:41:07 -0000
Local: Tues, Jul 3 2007 1:41 pm
Subject: Re: Packaging OCRopus
Hi Étienne,

Thank you for the work!

> First, ocropus use e.g. #include "imgio.h" instead of #include
> "../imgio/imgio.h" . I don't understand why and when it works or not.

Yes. This works because ImportDir directives in Jamfiles provide
header paths both for Jam and gcc. ImportDirs have to be there anyway
to provide dependencies between directories, so they're used also for
the headers. There's a plan to use ImportDirs for the libraries, too.

> Second, i have problem with linking ocropus with tesseract. I find some
> odd "PartialLinking" in Jamfile i don't understand.

It's an old hack made to cope with abundance of tesseract libraries.
It can be rid of: just move all the -ltesseract_stuff into top-level
Jamrules and delete all the stuff about tesseract_all.o, replacing
   LibraryFromObjects libtesseract.a : tesseract_all.o ;
with
   Library libtesseract : tesseract.cc ;
I'd do that but maybe it's better to simply merge 11 Tesseract
libraries into one.

I'll have a closer look at your patch and bug report tomorrow.

Again thank you and good luck with your project.

Best wishes,
Ilya


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Thomas Breuel  
View profile  
 More options Jul 6 2007, 1:41 am
From: "Thomas Breuel" <tmb...@gmail.com>
Date: Fri, 6 Jul 2007 07:41:55 +0200
Local: Fri, Jul 6 2007 1:41 am
Subject: Re: Packaging OCRopus

> I'd do that but maybe it's better to simply merge 11 Tesseract
> libraries into one.

Well, having OCRopus build systems contain 11 Tesseract libraries just
doesn't make much sense.  Keep in mind that those 11 libraries not only need
to be listed by every software package using Tesseract, they also need to be
installed in /usr/lib.

Tesseract should be a single library, and the best thing to do is to change
the Tesseract build system to create a single library.

Until then, let's not change the OCRopus Jamfiles.  The automake stuff can
list the Tesseract libraries individually if it likes.

Cheers,
Tom


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »

Create a group - Google Groups - Google Home - Terms of Service - Privacy Policy
©2009 Google