combine_tessdata not found on Ubuntu 14.04

280 views
Skip to first unread message

Traun Leyden

unread,
Jul 20, 2014, 12:25:12 AM7/20/14
to tesser...@googlegroups.com

According to http://manpages.ubuntu.com/manpages/trusty/man1/combine_tessdata.1.html, combine_tessdata should be installed on Ubuntu 14.04, but I'm not seeing it anywhere.

# dpkg -L tesseract-ocr
/.
/usr
/usr/share
/usr/share/man
/usr/share/man/man1
/usr/share/man/man1/cntraining.1.gz
/usr/share/man/man1/tesseract.1.gz
/usr/share/man/man1/shapeclustering.1.gz
/usr/share/man/man1/combine_tessdata.1.gz
/usr/share/man/man1/unicharset_extractor.1.gz
/usr/share/man/man1/ambiguous_words.1.gz
/usr/share/man/man1/wordlist2dawg.1.gz
/usr/share/man/man1/mftraining.1.gz
/usr/share/man/man1/dawg2wordlist.1.gz
/usr/share/doc
/usr/share/doc/tesseract-ocr
/usr/share/doc/tesseract-ocr/copyright
/usr/bin
/usr/bin/tesseract
/usr/share/doc/tesseract-ocr/changelog.Debian.gz

It seems like the ubuntu package only installs the /usr/bin/tesseract binary and the man page for combine_tessdata.1.gz

To install combine_tessdata, will I need to build from source?

Traun Leyden

unread,
Jul 20, 2014, 2:25:46 AM7/20/14
to tesser...@googlegroups.com

I was able to build from source using the following Dockerfile:



In that docker container, all the tools such as combine_tessdata, wordlist2dawg, dawg2wordlist, etc seem to be working.

Steve Capell

unread,
Jul 20, 2014, 2:30:43 AM7/20/14
to tesser...@googlegroups.com
I upgraded from Ubuntu 12.04 to 14.04 yesterday and I am seeing the same thing.
Reply all
Reply to author
Forward
0 new messages