How to run make training for Repo installed Tesseract 3.03

497 views
Skip to first unread message

sibi kanagaraj

unread,
Nov 5, 2014, 8:33:19 AM11/5/14
to tesser...@googlegroups.com
Dear all ,

I am using ubuntu 14.04 LTS on a 64 bit machine .

I installed Tesseract using apt-get install tesseract-ocr

The tesseract-v gives me the following

tesseract 3.03
 leptonica-1.71
  libjpeg 8d : libpng 1.2.50 : libtiff 4.0.3 : zlib 1.2.8

Now I wanted to install the training module .

Tried the following link
https://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3

which says

"Beginning with 3.03, if you're compiling Tesseract from source you need to make and install the training tools with separate make commands. Once the above additional libraries have been installed, run the following from the tesseract source directory:"

I have not installed it from source . I have installed form Repo . How and where am i to run the

make training
sudo make training-install

Checking for any Tesseract related files , I only notice a /usr/share/tesseract-ocr/tessdata which is present .

Thank you
Sibi


ShreeDevi Kumar

unread,
Nov 5, 2014, 8:41:12 AM11/5/14
to tesser...@googlegroups.com
Did you install the latest version from 

If so, it should have the trainingtools.

Try 
which text2image 
to see if it installed


ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/9bca7a52-f411-4746-8281-c9f949596305%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Quan Nguyen

unread,
Nov 5, 2014, 8:48:03 AM11/5/14
to tesser...@googlegroups.com
I read Ubuntu 14.10 has Tesseract training executable.

sibi kanagaraj

unread,
Nov 5, 2014, 9:09:09 AM11/5/14
to tesser...@googlegroups.com
Dear Shree and Quan ,

Thank you for the replies. I solved the issue using Shree's direction .

Here is what I did :

1.Removed any previous installations using
sudo apt-get remove tesseract-ocr
sudo apt-get remove --auto-remove tesseract-ocr
 
2.Downloaded the
[tesseract_3.03.03.orig.tar.gz] from http://packages.ubuntu.com/utopic/tesseract-ocr

3.Extracted it to /home/sibi/tesseract-3.03

4.Folowed the normal procedure of

./autogen.sh
./configure
make
sudo make install
sudo ldconfig

5.Then used

make training
sudo make training-install

6.To confirm if the training module is installed did a check with the commands
"unicharset_extractor"
"text2image" .

Thank you once again Shree and Quan .

-Sibi

sibi kanagaraj

unread,
Nov 5, 2014, 12:25:00 PM11/5/14
to tesser...@googlegroups.com
But ,

As a first time , I am getting this error when I tried to check for languages using tesseract --list-langs

//Error opening data file /usr/local/share/tessdata/eng.traineddata
Please make sure the TESSDATA_PREFIX environment variable is set to the parent directory of your "tessdata" directory.
Failed loading language 'eng'
Tesseract couldn't load any languages!
Could not initialize tesseract.
//

-Sibi

sibi kanagaraj

unread,
Nov 5, 2014, 12:31:27 PM11/5/14
to tesser...@googlegroups.com
Sorry to have asked that question . Sorted it out that

export TESSDATA_PREFIX=/home/sibi/tesseract-3.03/tessdata.

-Sibi
Reply all
Reply to author
Forward
0 new messages