Need general help

7 views
Skip to first unread message

spectrum

unread,
May 10, 2009, 11:48:59 PM5/10/09
to tesseract-ocr
Hi all. I'm running Ubuntu 8.04.

I have installed tesseract via the Ubuntu repositories, via synaptic
package manager.

Can someone please point me to a link where it teaches me how to
operate tesseract? I've tried typing tesseract into the command-line
(brought up via F2) but nothing happens.

I've also tried (from http://www.howtoforge.com/ocr_with_tesseract_on_ubuntu704)

tesseract %tiff_file% %name_for_resulting_files%

e.g.: tesseract document.tif result

With different file names.

Is there a specific location the files need to be in? I've followed
the steps previous from the howtoforge, I have uncompressed tifs.

I need really general and basic help, being quite new to Ubuntu. A
link would do.

Thanks!

paulfeakins

unread,
May 12, 2009, 6:06:42 AM5/12/09
to tesseract-ocr
Hi Spectrum,

Always good to hear more people are using Linux :)

First thing I'd suggest is to use an actual terminal window instead of
pressing F2. That way you'll be able to see what output you get and
whether there's been an error.

In Ubuntu (8.10) go to Applications > Accessories > Terminal. You
might have to type "sudo" before every command so that you have the
necessary permissions. i.e. "sudo tesseract..."

Have a good look through the wiki http://code.google.com/p/tesseract-ocr/w/list

Also, I seem to remember there's an issue with 2.03 where it needs a
patch before it will run on Ubuntu, so if it doesn't work, paste the
error message back to this thread and someone will point you in the
right direction!

Hope that helps,
Paul.

nguyenq

unread,
May 12, 2009, 7:36:04 PM5/12/09
to tesseract-ocr
You can use a user-friendly frontend for Tesseract on Linux.

http://vietocr.sf.net

Sherilyn Lim

unread,
May 12, 2009, 11:15:46 PM5/12/09
to tesser...@googlegroups.com
Thank you very much everyone!

It's working now :D

EarlWer

unread,
Jun 13, 2009, 12:32:29 AM6/13/09
to tesseract-ocr
I have a similar problem. Ubuntu 8.04 server, so no GUI ;-(
I want to convert incoming faxes received by hylafax in .tif format.

When I run tesseract: sudo tesseract fax000000083.tif ocrtest
it displays 'Tesseract Open Source OCR Engine', thinks for a 10
seconds and doesn't seem to do anything.

No logs, no ocrtest.txt nothing.

Any ideas?
Is there a debug mode? --verbose flag?





On May 12, 6:06 am, paulfeakins <paulfeak...@gmail.com> wrote:
> Hi Spectrum,
>
> Always good to hear more people are using Linux :)
>
> First thing I'd suggest is to use an actual terminal window instead of
> pressing F2. That way you'll be able to see what output you get and
> whether there's been an error.
>
> In Ubuntu (8.10) go to Applications > Accessories > Terminal. You
> might have to type "sudo" before every command so that you have the
> necessary permissions. i.e. "sudo tesseract..."
>
> Have a good look through the wikihttp://code.google.com/p/tesseract-ocr/w/list

EarlWer

unread,
Jun 13, 2009, 12:41:39 AM6/13/09
to tesseract-ocr
Let me update the program. I did an apt-get and got an older
version.

EarlWer

unread,
Jun 13, 2009, 2:06:49 AM6/13/09
to tesseract-ocr
Solved! I downloaded version 2.03 and now it works.
Now I've got hylafax receiving the faxes, converting to .pdf with an
OCR .txt file.

Next stop: Parse the data and file it...

Lukasz Szybalski

unread,
Jul 11, 2009, 1:01:39 PM7/11/09
to tesser...@googlegroups.com
On Sat, Jun 13, 2009 at 1:06 AM, EarlWer<ear...@gmail.com> wrote:
>
> Solved!  I downloaded version 2.03 and now it works.
> Now I've got hylafax receiving the faxes, converting to .pdf with an
> OCR .txt file.


Hello,

How is the quality of the converted text?
What are some of the commands that you are using? Could you copy and paste few.

Anything special that needs to be done between hylafax and tesseract?

Thanks,
Lucas
--
Using rsync. How to setup rsyncd.
http://lucasmanual.com/mywiki/rsync
DataHub - create a package that gets, parses, loads, visualizes data
http://lucasmanual.com/mywiki/DataHub
Reply all
Reply to author
Forward
0 new messages