Who can help me with this Dutch text?

31 views
Skip to first unread message

Pieter

unread,
Dec 24, 2009, 8:30:32 AM12/24/09
to tesseract-ocr
Hi all,

I just downloaded Tesseract (including the Dutch language files) to my
Ubuntu 9.10 installation from the repository. I use the terminal to
execute Tesseract.

The reason I downloaded Tesseract is that I would like to have digital
versions of a few books I own. I tested the program with a random text-
image (English) from internet, it worked like a charm. However, my
picture of a page of the book (edited with GIMP) results in rubbish:
http://www.redpanda.nl/img.tif (about 14MB).

Could somebody explain why it doesn't work in this case?
Thanks :)

Pieter

unread,
Dec 26, 2009, 7:37:17 AM12/26/09
to tesseract-ocr
Hmm I know what caused the bad quality, the picture wasn't in
grayscale yet... It works fine now!

Because there isn't much information available on Tesseract, I created
a little tutorial, http://redpanda.nl/Tesseract/ (in fact it's about
the preparation of your image before you use Tesseract).
Please let me know if you find it useful :)

On Dec 24, 2:30 pm, Pieter <pie...@redpanda.nl> wrote:
> Hi all,
>
> I just downloaded Tesseract (including the Dutch language files) to my
> Ubuntu 9.10 installation from the repository. I use the terminal to
> execute Tesseract.
>
> The reason I downloaded Tesseract is that I would like to have digital
> versions of a few books I own. I tested the program with a random text-
> image (English) from internet, it worked like a charm. However, my

> picture of a page of the book (edited with GIMP) results in rubbish:http://www.redpanda.nl/img.tif(about 14MB).

Reply all
Reply to author
Forward
0 new messages