Building training tools from source

506 views
Skip to first unread message

Peter Hamberg

unread,
Aug 1, 2014, 5:06:50 AM8/1/14
to tesser...@googlegroups.com
Hi,
I'm trying to learn how to use tesseract, but I need some help, I think. I'm currently stuck on the training tesseract part, because for some reason I cant make the training tools.

I'm on a Ubuntu 14.04 machine, and I've followed the instructions on https://code.google.com/p/tesseract-ocr/wiki/Compiling - added the libraries, leptonica, 
config, make, make install, all that seems to work without any error messages.

But when I try
make training


all i get is the message "make: Nothing to be done for `training'."
Am I in the wrong folder? What have I missed here?

// Peter

Paul

unread,
Aug 1, 2014, 5:44:08 AM8/1/14
to tesser...@googlegroups.com
Did you see this thread?


Although your problem seems different...

zdenko podobny

unread,
Aug 1, 2014, 7:00:33 AM8/1/14
to tesser...@googlegroups.com

On Fri, Aug 1, 2014 at 11:06 AM, Peter Hamberg <zazca...@gmail.com> wrote:
Hi,
This is a secure message chain, protected by Virtru.

I'm on a Ubuntu 14.04 machine, and I've followed the instructions on https://code.google.com/p/tesseract-ocr/wiki/Compiling

and there is written: "If you want the training tools (3.03)..."
It means that that instruction are valid for 3.03 version and you are compiling 3.02.02 version (where training tools are built automatically).
 
- added the libraries, leptonica, 
config, make, make install, all that seems to work without any error messages.

But when I try

make training


all i get is the message "make: Nothing to be done for `training'."
Am I in the wrong folder? What have I missed here?

// Peter

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/ac1705f0-ed9e-408e-82a0-b835e4e8ad51%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Peter Hamberg

unread,
Aug 1, 2014, 8:17:23 AM8/1/14
to tesser...@googlegroups.com
I knew it had to be something obvious. I didn't realise that the version of the source code available from the download page wasnt the latest version. Thanks for the clarification. Switching to the newer version worked.

Shree Devi Kumar

unread,
Aug 1, 2014, 8:47:50 AM8/1/14
to tesser...@googlegroups.com
for training tools on ubuntu 14.04 

Jeff Breidenbach
Jul 31 (21 hours ago)
 

Re: [tesseract-dev] Re: Training tools linking failure, icu_48::* 

Here's a way to compile Tesseract on Ubuntu, while still 
using the packaging system. It will get you the training tools. 
 

sudo apt-get build-dep tesseract-ocr 
sudo apt-get install devscripts 
dget http://http.debian.net/debian/pool/main/t/tesseract/tesseract_3.03.03-1.dsc 
tar xvzf tesseract_3.03.03.orig.tar.gz 
cd tesseract-3.03 
zcat ../tesseract_3.03.03-1.diff.gz | patch -p1 
debuild -us -uc 
cd .. 
sudo dpkg -i *.deb 


Shree Devi Kumar
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com


Dovhani Foneworx

unread,
Aug 11, 2014, 9:17:45 AM8/11/14
to tesser...@googlegroups.com
Hi, when I do "make training" the following error showing that is a bug in the code comes up.


normstrngs.cpp:27:49: error: unicode/unorm2.h: No such file or directory
In file included from normstrngs.cpp:23:
icuerrorcode.h: In member function ‘virtual void tesseract::IcuErrorCode::handleFailure() const’:
icuerrorcode.h:55: error: ‘errorName’ was not declared in this scope
normstrngs.cpp: In function ‘void tesseract::NormalizeChar32(char32, GenericVector<int>*)’:
normstrngs.cpp:133: error: expected initializer before ‘*’ token
normstrngs.cpp:135: error: ‘class tesseract::IcuErrorCode’ has no member named ‘assertSuccess’
normstrngs.cpp:139: error: ‘nfkc’ was not declared in this scope
normstrngs.cpp:140: error: ‘class tesseract::IcuErrorCode’ has no member named ‘assertSuccess’
normstrngs.cpp: In function ‘bool tesseract::IsWhitespace(char32)’:
normstrngs.cpp:179: error: ‘u_isUWhiteSpace_4_2’ was not declared in this scope
normstrngs.cpp: In function ‘bool tesseract::IsUTF8Whitespace(const char*)’:
normstrngs.cpp:183: warning: comparison between signed and unsigned integer expressions
normstrngs.cpp: In function ‘bool tesseract::IsInterchangeValid(char32)’:
normstrngs.cpp:228: error: ‘u_isISOControl_4_2’ was not declared in this scope
normstrngs.cpp: In function ‘bool tesseract::IsInterchangeValid7BitAscii(char32)’:
normstrngs.cpp:235: error: ‘u_isISOControl_4_2’ was not declared in this scope
normstrngs.cpp: In function ‘char32 tesseract::FullwidthToHalfwidth(char32)’:
normstrngs.cpp:252: error: ‘class tesseract::IcuErrorCode’ has no member named ‘assertSuccess’
make: *** [normstrngs.lo] Error 1
foneworx@myubuntu:/var/www/DM/tess/tesseract-ocr-read-only/training$

zdenko podobny

unread,
Aug 11, 2014, 1:11:21 PM8/11/14
to tesser...@googlegroups.com
It looks like you ignored configure warnings...


Zdenko


Nick White

unread,
Aug 12, 2014, 11:25:35 AM8/12/14
to tesser...@googlegroups.com
Zdenko's right. To help you out more, you seem to have skipped over
this part of the instructions in the Compiling wiki page:

sudo apt-get install libicu-dev # (if you plan to make the training tools)

Nick
> tesseract-ocr/ab62eb7f-2370-4d59-9a98-f1029da73500%40googlegroups.com.
>
> For more options, visit https://groups.google.com/d/optout.
>
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an email
> to tesseract-oc...@googlegroups.com.
> To post to this group, send email to tesser...@googlegroups.com.
> Visit this group at http://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/msgid/
> tesseract-ocr/
> CAJbzG8zRDSb1xbYdzm8KhfjjHvF%2Bvc%2BwpwVU3ASq0pwj_ykD9g%40mail.gmail.com.

Dovhani Foneworx

unread,
Aug 15, 2014, 3:45:39 AM8/15/14
to tesser...@googlegroups.com
and also when I run

debuild -us -uc 
and
sudo dpkg -i *.deb

I get:


foneworx@herainternal:/var/www/DM/tess/tesseract-3.03$ debuild -us -uc
 dpkg-buildpackage -rfakeroot -D -us -uc
dpkg-buildpackage: set CFLAGS to default value: -g -O2
dpkg-buildpackage: set CPPFLAGS to default value:
dpkg-buildpackage: set LDFLAGS to default value: -Wl,-Bsymbolic-functions
dpkg-buildpackage: set FFLAGS to default value: -g -O2
dpkg-buildpackage: set CXXFLAGS to default value: -g -O2
dpkg-buildpackage: source package tesseract
dpkg-buildpackage: source version 3.03.03-1
dpkg-buildpackage: source changed by Jeff Breidenbach <j...@debian.org>
dpkg-buildpackage: host architecture amd64
dpkg-checkbuilddeps: Unmet build dependencies: libleptonica-dev (>= 1.70~)
dpkg-buildpackage: warning: Build dependencies/conflicts unsatisfied; aborting.
dpkg-buildpackage: warning: (Use -d flag to override.)
debuild: fatal error at line 1340:
dpkg-buildpackage -rfakeroot -D -us -uc failed
foneworx@herainternal:/var/www/DM/tess/tesseract-3.03$ cd ..
foneworx@herainternal:/var/www/DM/tess$ sudo dpkg -i *.deb
[sudo] password for foneworx:
dpkg: error processing *.deb (--install):
 cannot access archive: No such file or directory
Errors were encountered while processing:
 *.deb
foneworx@herainternal:/var/www/DM/tess$


You received this message because you are subscribed to a topic in the Google Groups "tesseract-ocr" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tesseract-ocr/qav8Xw6fttw/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tesseract-oc...@googlegroups.com.

To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.

zdenko podobny

unread,
Aug 15, 2014, 3:55:02 AM8/15/14
to tesser...@googlegroups.com
  1. if you are using debian tool you should try to use debian focused forum or raise issue at debian ;-) (it looks like do not have problem with tesseract, you have problem to build package from source)
  2. As far as see meesage is clear:

  1. dpkg-checkbuilddeps: Unmet build dependencies: libleptonica-dev (>= 1.70~)
    dpkg-buildpackage: warning: Build dependencies/conflicts unsatisfied; aborting.

Dovhani Foneworx

unread,
Aug 15, 2014, 4:43:47 AM8/15/14
to tesser...@googlegroups.com
Under which circumstances you will  need training tools, when you say "if you plan to use training tools"?.


On Tue, Aug 12, 2014 at 5:25 PM, Nick White <nick....@durham.ac.uk> wrote:
You received this message because you are subscribed to a topic in the Google Groups "tesseract-ocr" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tesseract-ocr/qav8Xw6fttw/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tesseract-oc...@googlegroups.com.

To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.

rahul tiwari

unread,
Mar 26, 2016, 6:56:17 AM3/26/16
to tesseract-ocr



Dear Sir

when i am running "sudo dpkg -i *.deb" last command its showing error ...i am new to tesseract and opencv please suggest me who i can install and link  tesseract with opencv.

dpkg: error processing archive *.deb (--install):
Reply all
Reply to author
Forward
0 new messages