error during installaton of tesseract-3.01 with leptonica-1.69 in ubuntu 13.04

378 views
Skip to first unread message

mama

unread,
Sep 2, 2013, 5:16:04 AM9/2/13
to tesser...@googlegroups.com
Sir,
Please help me
By following the procedure for installing tesseract-3.01 in ubuntu-13.10
Success for Leptonica
For tesseract-3.01
  ./autogen.sh
 ./configure
  make
I get the error

svutil.cpp:89:18: error: 'fork' was not declared in this scope
svutil.cpp:119:28: error: 'execvp' was not declared in this scope
svutil.cpp: In member function 'void SVNetwork::Close()':
svutil.cpp:262:16: error: 'close' was not declared in this scope
svutil.cpp: In constructor 'SVNetwork::SVNetwork(const char*, int)':
svutil.cpp:417:14: error: 'sleep' was not declared in this scope
make[3]: *** [svutil.lo] Error 1
make[3]: Leaving directory `/home/sahu/tesseract-3.01/
viewer'
make[2]: *** [all-recursive] Error 1
make[2]: Leaving directory `/home/sahu/tesseract-3.01/viewer'
make[1]: *** [all-recursive] Error 1
make[1]: Leaving directory `/home/sahu/tesseract-3.01'
make: *** [all] Error 2

Please help me how to proceed further

Nick White

unread,
Sep 2, 2013, 8:16:09 AM9/2/13
to tesser...@googlegroups.com
Why are you trying 3.01? I'd recommend 3.02.02, as it's the current
release.

Tesseract 3.02.02 is already packaged for Ubuntu. You can just do:

apt-get install tesseract-ocr

on the command line, or the equivalent in their "software center",
and it will all be installed for you.

Nick

mama

unread,
Sep 9, 2013, 10:43:58 AM9/9/13
to tesser...@googlegroups.com
sir
you advise me to use tesseract-3.02 but i have already use training procedure using tesseract-3.01 and because of I upgrade the Ubuntu to 13.04 it need to reinstall. So please can you say whether tesseract-3.01 can be installed in ubuntu-13.04

Sven Pedersen

unread,
Sep 9, 2013, 11:02:32 AM9/9/13
to tesser...@googlegroups.com
You can use the traineddata from 3.01 in 3.02, but you might get some benefit from retraining -- it will not require doing the whole procedure from scratch.
--Sven


--
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com
To unsubscribe from this group, send email to
tesseract-oc...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
 
---
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.



--
``All that is gold does not glitter,
  not all those who wander are lost;
the old that is strong does not wither,
  deep roots are not reached by the frost.
From the ashes a fire shall be woken,
  a light from the shadows shall spring;
renewed shall be blade that was broken,
  the crownless again shall be king.”

mamata nayak

unread,
Oct 17, 2013, 2:06:07 AM10/17/13
to tesser...@googlegroups.com
Sir
As per your suggestion i use Tesseract-3.02.2. for training.
But i found problem during shapeclustering
Please help me


tesseract-3.02$ shapeclustering -F font_properties -U unicharset LohitOriya.tr
Reading LohitOriya.tr ...
*** glibc detected *** shapeclustering: double free or corruption (!prev): 0x09044170 ***
======= Backtrace: =========
/lib/i386-linux-gnu/libc.so.6(+0x6ebc2)[0x7a0bc2]
/lib/i386-linux-gnu/libc.so.6(+0x6f862)[0x7a1862]
/lib/i386-linux-gnu/libc.so.6(cfree+0x6d)[0x7a494d]
/usr/lib/i386-linux-gnu/libstdc++.so.6(_ZdlPv+0x1f)[0xce980f]
/usr/lib/i386-linux-gnu/libstdc++.so.6(_ZdaPv+0x1b)[0xce986b]
shapeclustering(_ZN13GenericVectorIiE5clearEv+0x5c)[0x804e4ec]
/usr/local/lib/libtesseract.so.3(_ZN9tesseract17TrainingSampleSet14SetupFontIdMapEv+0x113)[0x4ab2e3]
/usr/local/lib/libtesseract.so.3(_ZN9tesseract17TrainingSampleSet22OrganizeByFontAndClassEv+0x24)[0x4adaf4]
/usr/local/lib/libtesseract.so.3(_ZN9tesseract13MasterTrainer24ReplaceFragmentedSamplesEv+0x146)[0x49ca26]
/usr/local/lib/libtesseract.so.3(_ZN9tesseract13MasterTrainer15PostLoadCleanupEv+0x110)[0x49cca0]
shapeclustering[0x804e0c8]
shapeclustering(main+0x50)[0x804b6b0]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0x74b113]
shapeclustering[0x804be39]
======= Memory map: ========
00110000-00123000 r-xp 00000000 08:03 2909176    /lib/i386-linux-gnu/libz.so.1.2.3.4
00123000-00124000 r--p 00012000 08:03 2909176    /lib/i386-linux-gnu/libz.so.1.2.3.4
00124000-00125000 rw-p 00013000 08:03 2909176    /lib/i386-linux-gnu/libz.so.1.2.3.4
00125000-0014d000 r-xp 00000000 08:03 2908164    /lib/i386-linux-gnu/libpng12.so.0.46.0
0014d000-0014e000 r--p 00027000 08:03 2908164    /lib/i386-linux-gnu/libpng12.so.0.46.0
0014e000-0014f000 rw-p 00028000 08:03 2908164    /lib/i386-linux-gnu/libpng12.so.0.46.0
00154000-0016b000 r-xp 00000000 08:03 2909153    /lib/i386-linux-gnu/libpthread-2.13.so
0016b000-0016c000 r--p 00016000 08:03 2909153    /lib/i386-linux-gnu/libpthread-2.13.so
0016c000-0016d000 rw-p 00017000 08:03 2909153    /lib/i386-linux-gnu/libpthread-2.13.so
0016d000-0016f000 rw-p 00000000 00:00 0
0016f000-00191000 r-xp 00000000 08:03 1245488    /usr/lib/i386-linux-gnu/libjpeg.so.62.0.0
00191000-00192000 r--p 00022000 08:03 1245488    /usr/lib/i386-linux-gnu/libjpeg.so.62.0.0
00192000-00193000 rw-p 00023000 08:03 1245488    /usr/lib/i386-linux-gnu/libjpeg.so.62.0.0
00193000-001f3000 r-xp 00000000 08:03 426185     /usr/lib/i386-linux-gnu/libtiff.so.4.3.4
001f3000-001f5000 r--p 0005f000 08:03 426185     /usr/lib/i386-linux-gnu/libtiff.so.4.3.4
001f5000-001f6000 rw-p 00061000 08:03 426185     /usr/lib/i386-linux-gnu/libtiff.so.4.3.4
0022f000-00257000 r-xp 00000000 08:03 2909123    /lib/i386-linux-gnu/libm-2.13.so
00257000-00258000 r--p 00028000 08:03 2909123    /lib/i386-linux-gnu/libm-2.13.so
00258000-00259000 rw-p 00029000 08:03 2909123    /lib/i386-linux-gnu/libm-2.13.so
00295000-002b3000 r-xp 00000000 08:03 2909080    /lib/i386-linux-gnu/ld-2.13.so
002b3000-002b4000 r--p 0001d000 08:03 2909080    /lib/i386-linux-gnu/ld-2.13.so
002b4000-002b5000 rw-p 0001e000 08:03 2909080    /lib/i386-linux-gnu/ld-2.13.so
002b5000-00586000 r-xp 00000000 08:03 504810     /usr/local/lib/libtesseract.so.3.0.2
00586000-0058a000 r--p 002d1000 08:03 504810     /usr/local/lib/libtesseract.so.3.0.2
0058a000-0058e000 rw-p 002d5000 08:03 504810     /usr/local/lib/libtesseract.so.3.0.2
0058e000-00627000 rw-p 00000000 00:00 0
00731000-00732000 r-xp 00000000 00:00 0          [vdso]
00732000-008a8000 r-xp 00000000 08:03 2909093    /lib/i386-linux-gnu/libc-2.13.so
008a8000-008aa000 r--p 00176000 08:03 2909093    /lib/i386-linux-gnu/libc-2.13.so
008aa000-008ab000 rw-p 00178000 08:03 2909093    /lib/i386-linux-gnu/libc-2.13.so
008ab000-008ae000 rw-p 00000000 00:00 0
008f4000-00910000 r-xp 00000000 08:03 2909114    /lib/i386-linux-gnu/libgcc_s.so.1
00910000-00911000 r--p 0001b000 08:03 2909114    /lib/i386-linux-gnu/libgcc_s.so.1
00911000-00912000 rw-p 0001c000 08:03 2909114    /lib/i386-linux-gnu/libgcc_s.so.1
00912000-00ad8000 r-xp 00000000 08:03 501220     /usr/local/lib/liblept.so.3.0.0
00ad8000-00ad9000 r--p 001c5000 08:03 501220     /usr/local/lib/liblept.so.3.0.0
00ad9000-00ae0000 rw-p 001c6000 08:03 501220     /usr/local/lib/liblept.so.3.0.0
00c3c000-00d1a000 r-xp 00000000 08:03 1245598    /usr/lib/i386-linux-gnu/libstdc++.so.6.0.16
00d1a000-00d1b000 ---p 000de000 08:03 1245598    /usr/lib/i386-linux-gnu/libstdc++.so.6.0.16
00d1b000-00d1f000 r--p 000de000 08:03 1245598    /usr/lib/i386-linux-gnu/libstdc++.so.6.0.16
00d1f000-00d20000 rw-p 000e2000 08:03 1245598    /usr/lib/i386-linux-gnu/libstdc++.so.6.0.16
00d20000-00d27000 rw-p 00000000 00:00 0
08048000-08052000 r-xp 00000000 08:03 504822     /usr/local/bin/shapeclustering
08052000-08053000 r--p 00009000 08:03 504822     /usr/local/bin/shapeclustering
08053000-08054000 rw-p 0000a000 08:03 504822     /usr/local/bin/shapeclustering
08fca000-090d4000 rw-p 00000000 00:00 0          [heap]
b7700000-b7721000 rw-p 00000000 00:00 0
b7721000-b7800000 ---p 00000000 00:00 0
b784e000-b7853000 rw-p 00000000 00:00 0
b7862000-b7864000 rw-p 00000000 00:00 0
bfc39000-bfc5a000 rw-p 00000000 00:00 0          [stack]
Aborted








You received this message because you are subscribed to a topic in the Google Groups "tesseract-ocr" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tesseract-ocr/XmVuM-6AIH8/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tesseract-oc...@googlegroups.com.

zdenko podobny

unread,
Oct 17, 2013, 2:44:09 AM10/17/13
to tesser...@googlegroups.com
On Thu, Oct 17, 2013 at 8:06 AM, mamata nayak <mamat...@gmail.com> wrote:
Sir
As per your suggestion i use Tesseract-3.02.2. for training.
But i found problem during shapeclustering
Please help me


tesseract-3.02$ shapeclustering -F font_properties -U unicharset LohitOriya.tr
Reading LohitOriya.tr ...
*** glibc detected *** shapeclustering: double free or corruption (!prev): 0x09044170 ***

Your command reveals that you do not care about official training instruction. So you are alone with your problems because of your decision ;-). 
Follow official instructions[1] closely and you will have less problems.




Zdenko

Shree Devi Kumar

unread,
Oct 17, 2013, 3:00:01 AM10/17/13
to tesser...@googlegroups.com
​​Please read the training page referred by zdenko and follow instructions.

Your training file name should follow the format 
lang.fontname.exp0.tr 
There should be consistency in naming within font_properties file, file names and the commands that you use. 

Clustering

When the character features of all the training pages have been extracted, we need to cluster them to create the prototypes. The character shape features can be clustered using the shapeclustering (available from 3.02 version), mftraining and cntraining programs:

shapeclustering -F font_properties -U unicharset lang.fontname.exp0.tr lang.fontname.exp1.tr ...

shapeclustering creates the master shape table by shape clustering and writes it to a file - shapetable. It currently should not be used except for the Indic languages.



Shree Devi Kumar
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com


--

mamata nayak

unread,
Oct 17, 2013, 3:30:50 AM10/17/13
to tesser...@googlegroups.com
Even after renaming all files same error occur.

Can you suggest me please, should i use tesseract-3.01 or tesseract-3.02 for odia language. 


You received this message because you are subscribed to a topic in the Google Groups "tesseract-ocr" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tesseract-ocr/XmVuM-6AIH8/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tesseract-oc...@googlegroups.com.

Sriranga(79yrs)

unread,
Oct 17, 2013, 3:59:46 AM10/17/13
to tesser...@googlegroups.com
In ubuntu 13.10 -  tesseract 3.02 is available as default in ubuntu software centre.

mamata nayak

unread,
Oct 17, 2013, 4:39:59 AM10/17/13
to tesser...@googlegroups.com
Sir
I have already use tesseract-3.01 for my language (indic).
please suggest me should i go for tessract-3.01 or  3.02
Reply all
Reply to author
Forward
0 new messages