Difference between sudo apt-get install tesseract and installing from source

84 views
Skip to first unread message

sibi kanagaraj

unread,
Oct 12, 2014, 1:19:17 AM10/12/14
to tesser...@googlegroups.com
Hello all ,

What is the difference between sudo apt-get install tesseract and installing from source  .
Let me explain the context so that it could give a clear picture .

I wanted to work with Tesseract and build better Training Data for Tamil language as well as see if any modifications could be done to engine also to make it more responsive for Tamil .
1.Hence installed Tesseract via sudo apt-get install tesseract .
2.I could see that the files and folders relating to tessdata were created in /usr/share/tesseract-ocr
3.And installed all the necessary packages as mentioned here
https://code.google.com/p/tesseract-ocr/wiki/Compiling
4.Now the command tesseract is working fine .
5.I wanted to see the debugging of the system , hence posted a question here
https://groups.google.com/forum/#!topic/tesseract-ocr/BxZvoUlxjOc
6.I was asked to perform the make file operations in Java folder .
7.Since within the /usr/share I could not find java folder , I created a folder and tried to do the make .But it did not work out .
8.As per zdenop reply in the above , I am trying to configure it from the source and hence created a clone using
9.Here I could find the /java folder .

My questions are
1.What is the difference between package installation and installing from source ?
2.Where can I find the files if I am doing a package installation ?
3.Am I doing the entire process wrong ?


Rick Leir

unread,
Oct 17, 2014, 9:26:41 AM10/17/14
to tesser...@googlegroups.com
You probably got the source for a different version of Tesseract. This might not matter, depending on what you are doing. Find out the version by running it: you will see 'Tesseract Open Source OCR Engine v3.04.00 with Leptonica' or similar.

How to train:
http://blog.cedric.ws/how-to-train-tesseract-301

Cedric's server is not responding so try:
http://webcache.googleusercontent.com/search?q=cache:RDSrO-_YQLgJ:blog.cedric.ws/how-to-train-tesseract-301+&cd=1&hl=en&ct=clnk&gl=ca&client=ubuntu
Reply all
Reply to author
Forward
0 new messages