unicharset_extractor: command not found

1,199 views
Skip to first unread message

73483...@stevenma.xyz

unread,
Apr 11, 2016, 3:14:16 AM4/11/16
to tesseract-ocr
I installed tesseract through macport and I ran into this issue while trying to run unicharset_extractor. Full question at: http://stackoverflow.com/questions/36536490/tesseract-unicharset-extractor-command-not-found

ShreeDevi Kumar

unread,
Apr 11, 2016, 4:31:00 AM4/11/16
to tesser...@googlegroups.com
You need to build training tools in order to run unicharset_extractor

make training
make training-install

You will get 11 more executables in addition to tesseract. On windows I have the following:

ambiguous_words.exe    
combine_tessdata.exe  
set_unicharset_properties.exe  
unicharset_extractor.exe
classifier_tester.exe  
dawg2wordlist.exe    
shapeclustering.exe            
wordlist2dawg.exe
cntraining.exe        
mftraining.exe       
text2image.exe

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Mon, Apr 11, 2016 at 7:48 AM, <73483...@stevenma.xyz> wrote:
I installed tesseract through macport and I ran into this issue while trying to run unicharset_extractor. Full question at: http://stackoverflow.com/questions/36536490/tesseract-unicharset-extractor-command-not-found

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/49a20038-f262-4590-9910-1a768d17247b%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

ShreeDevi Kumar

unread,
Apr 11, 2016, 5:02:39 AM4/11/16
to tesser...@googlegroups.com

Looks like macports does not include training tools

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Mon, Apr 11, 2016 at 7:48 AM, <73483...@stevenma.xyz> wrote:
I installed tesseract through macport and I ran into this issue while trying to run unicharset_extractor. Full question at: http://stackoverflow.com/questions/36536490/tesseract-unicharset-extractor-command-not-found

--

Tom Morris

unread,
Apr 12, 2016, 12:32:22 PM4/12/16
to tesseract-ocr
On Monday, April 11, 2016 at 5:02:39 AM UTC-4, shree wrote:

Looks like macports does not include training tools

I'm not sure the training tools should necessarily be included in the standard package.

It looks like Homebrew has the option to install the training tools by using a --with-training-tools switch, so I created an enhancement request for the same functionality in MacPorts: https://trac.macports.org/ticket/51127

For the OP, you'll need to do one of the following:

1.
$ port uninstall tesseract
$ brew install tesseract --with-training-tools

2.
$ git clone g...@github.com:tesseract-ocr/tesseract.git
$ cd tesseract
$ make training
$ make training-install


Tom

Børre Ludvigsen

unread,
May 20, 2016, 4:32:37 AM5/20/16
to tesseract-ocr
I downloaded tesseract-master/ from git-hub, ran ./autogen.sh, then ran ./configure, which exited with "configure: error: leptonica library with pdf support (>= 1.71) is missing".

What to do? (I have a MacPorts working version of tesseract installed and also a working version of the  jTessBoxEditor but need the training tools.)

- Thanks
Reply all
Reply to author
Forward
0 new messages