Re: [tesseract-ocr] Training tools don't get built when building tesseract from souce

109 views
Skip to first unread message
Message has been deleted

Shree Devi Kumar

unread,
Aug 12, 2018, 11:51:57 PM8/12/18
to tesser...@googlegroups.com
sudo apt-get remove tesseract-ocr
sudo apt-get remove libtesseract-dev

sudo add-apt-repository ppa:alex-p/tesseract-ocr
sudo apt-get update

sudo apt install tesseract-ocr
sudo apt install libtesseract-dev  

The above will  unintsall and then install the latest version binaries from ppa. 

Why are you then again building from source?

ppa installs the training scripts and puts them in the path too. 

On Mon, Aug 13, 2018 at 3:24 AM Shandigutt <strid...@gmail.com> wrote:
Hi,

  • I was working with a previous version of Tesseract and I was asked to get the latest snapshot when I came across the below error,

  • I followed the wiki and below video tutorials to build Tesseract from source,

  • I didn't build Leptonica as I had already built and installed 1.74.4 version,

  • I followed exactly below commands. I have attached the output of some of them,

sudo apt-get remove tesseract-ocr
sudo apt-get remove libtesseract-dev

sudo add-apt-repository ppa:alex-p/tesseract-ocr
sudo apt-get update

sudo apt install tesseract-ocr
sudo apt install libtesseract-dev

sudo apt-get install g++ # or clang++ (presumably)
sudo apt-get install autoconf automake libtool
sudo apt-get install pkg-config
sudo apt-get install libpng-dev
sudo apt-get install libjpeg8-dev
sudo apt-get install libtiff5-dev
sudo apt-get install zlib1g-dev

sudo apt-get install libicu-dev
sudo apt-get install libpango1.0-dev
sudo apt-get install libcairo2-dev

git clone https://github.com/tesseract-ocr/tesseract.git     #output: git_clone.txt
cd tesseract
autoreconf -vi   #output: autoreconf_vi.txt
./autogen.sh     #output: autogen_sh.txt
./configure --enable-debug      #output: configure_enable_debug.txt
LDFLAGS="L/usr/local/lib" CFLAGS="-I/usr/local/include" make       #output: make.txt
sudo make install       #output: make_install.txt
sudo ldconfig        #output: ldconfig.txt
make training        #output: make_training.txt
sudo make training-install       #output: make_training_install.txt


  • My OS details are as below,
Linux shandigutt-laptop-ubuntu 4.4.0-130-generic #156-Ubuntu SMP Thu Jun 14 08:53:28 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux

After building Tesseract following above steps, when I queried Tesseract version I get the below output,
tesseract -v
tesseract 4.0.0-beta.4-26-gfd49
 leptonica-1.74.4
  libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 : zlib 1.2.8

 Found SSE


  • Files tree in tesseract directory after building is attached as tesseract_tree.txt

I can't find all the training scripts I had previously. Appreciate your help on this.

Thanks,

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/76b4e521-7e0e-4189-befb-b97b24b0f354%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


--

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
Message has been deleted

Shree Devi Kumar

unread,
Aug 15, 2018, 12:06:32 AM8/15/18
to tesser...@googlegroups.com
libtool: install: /usr/bin/install -c .libs/combine_lang_model /usr/local/bin/combine_lang_model
libtool: install: /usr/bin/install -c .libs/combine_tessdata /usr/local/bin/combine_tessdata
libtool: install: /usr/bin/install -c .libs/dawg2wordlist /usr/local/bin/dawg2wordlist
libtool: install: /usr/bin/install -c .libs/lstmeval /usr/local/bin/lstmeval
libtool: install: /usr/bin/install -c .libs/lstmtraining /usr/local/bin/lstmtraining
libtool: install: /usr/bin/install -c .libs/merge_unicharsets /usr/local/bin/merge_unicharsets
libtool: install: /usr/bin/install -c .libs/set_unicharset_properties /usr/local/bin/set_unicharset_properties
libtool: install: /usr/bin/install -c .libs/text2image /usr/local/bin/text2image
libtool: install: /usr/bin/install -c .libs/unicharset_extractor /usr/local/bin/unicharset_extractor
libtool: install: /usr/bin/install -c .libs/wordlist2dawg /usr/local/bin/wordlist2dawg
libtool: install: /usr/bin/install -c .libs/ambiguous_words /usr/local/bin/ambiguous_words
libtool: install: /usr/bin/install -c .libs/classifier_tester /usr/local/bin/classifier_tester
libtool: install: /usr/bin/install -c .libs/cntraining /usr/local/bin/cntraining
libtool: install: /usr/bin/install -c .libs/mftraining /usr/local/bin/mftraining
libtool: install: /usr/bin/install -c .libs/shapeclustering /usr/local/bin/shapeclustering

The files are installed in /usr/local/bin

Shree Devi Kumar

unread,
Aug 15, 2018, 12:08:46 AM8/15/18
to tesser...@googlegroups.com
│   │   ├── training
│   │   │   ├── ambiguous_words
│   │   │   ├── ambiguous_words.o
│   │   │   ├── boxchar.lo
│   │   │   ├── boxchar.o
│   │   │   ├── classifier_tester
│   │   │   ├── classifier_tester.o
│   │   │   ├── cntraining
│   │   │   ├── cntraining.o
│   │   │   ├── combine_lang_model
│   │   │   ├── combine_lang_model.o
│   │   │   ├── combine_tessdata
│   │   │   ├── combine_tessdata.o
│   │   │   ├── commandlineflags.lo
│   │   │   ├── commandlineflags.o
│   │   │   ├── commontraining.lo
│   │   │   ├── commontraining.o
│   │   │   ├── dawg2wordlist
│   │   │   ├── dawg2wordlist.o
│   │   │   ├── degradeimage.lo
│   │   │   ├── degradeimage.o
│   │   │   ├── fileio.lo
│   │   │   ├── fileio.o
│   │   │   ├── lang_model_helpers.lo
│   │   │   ├── lang_model_helpers.o
│   │   │   ├── libtesseract_tessopt.la
│   │   │   ├── libtesseract_training.la
│   │   │   ├── ligature_table.lo
│   │   │   ├── ligature_table.o
│   │   │   ├── lstmeval
│   │   │   ├── lstmeval.o
│   │   │   ├── lstmtester.lo
│   │   │   ├── lstmtester.o
│   │   │   ├── lstmtraining
│   │   │   ├── lstmtraining.o
│   │   │   ├── Makefile
│   │   │   ├── mergenf.o
│   │   │   ├── merge_unicharsets
│   │   │   ├── merge_unicharsets.o
│   │   │   ├── mftraining
│   │   │   ├── mftraining.o
│   │   │   ├── normstrngs.lo
│   │   │   ├── normstrngs.o
│   │   │   ├── pango_font_info.lo
│   │   │   ├── pango_font_info.o
│   │   │   ├── set_unicharset_properties
│   │   │   ├── set_unicharset_properties.o
│   │   │   ├── shapeclustering
│   │   │   ├── shapeclustering.o
│   │   │   ├── stringrenderer.lo
│   │   │   ├── stringrenderer.o
│   │   │   ├── tessopt.lo
│   │   │   ├── tessopt.o
│   │   │   ├── text2image
│   │   │   ├── text2image.o
│   │   │   ├── tlog.lo
│   │   │   ├── tlog.o
│   │   │   ├── unicharset_extractor
│   │   │   ├── unicharset_extractor.o
│   │   │   ├── unicharset_training_utils.lo
│   │   │   ├── unicharset_training_utils.o
│   │   │   ├── validate_grapheme.lo
│   │   │   ├── validate_grapheme.o
│   │   │   ├── validate_indic.lo
│   │   │   ├── validate_indic.o
│   │   │   ├── validate_javanese.lo
│   │   │   ├── validate_javanese.o
│   │   │   ├── validate_khmer.lo
│   │   │   ├── validate_khmer.o
│   │   │   ├── validate_myanmar.lo
│   │   │   ├── validate_myanmar.o
│   │   │   ├── validator.lo
│   │   │   ├── validator.o
│   │   │   ├── wordlist2dawg
│   │   │   └── wordlist2dawg.o


Shandigutt

unread,
Sep 1, 2018, 5:55:50 PM9/1/18
to tesseract-ocr
Thanks a lot Shree,

it's already installed. 
Reply all
Reply to author
Forward
0 new messages