shapeclustering

42 views
Skip to first unread message

mama

unread,
Oct 17, 2013, 3:12:05 AM10/17/13
to tesser...@googlegroups.com
I am success in training tesseract-3.01 by following the official instruction[1].
Now i use the same tif and box file, generate .tr file and rename, create font_properties file and generate unicharset file.

.
During shapeclustering i found the following error'
So please suggest me what I will do???????????

Errors in shapeclustering

$ shapeclustering -F font_properties -U unicharset LohitOriya.trReading LohitOriya.tr ...
*** glibc detected *** shapeclustering: double free or corruption (!prev): 0x08e43170 ***
======= Backtrace: =========
/lib/i386-linux-gnu/libc.so.6(+0x6ebc2)[0xb68bc2]
/lib/i386-linux-gnu/libc.so.6(+0x6f862)[0xb69862]
/lib/i386-linux-gnu/libc.so.6(cfree+0x6d)[0xb6c94d]
/usr/lib/i386-linux-gnu/libstdc++.so.6(_ZdlPv+0x1f)[0x74880f]
/usr/lib/i386-linux-gnu/libstdc++.so.6(_ZdaPv+0x1b)[0x74886b]
shapeclustering(_ZN13GenericVectorIiE5clearEv+0x5c)[0x804e4ec]
/usr/local/lib/libtesseract.so.3(_ZN9tesseract17TrainingSampleSet14SetupFontIdMapEv+0x113)[0x3062e3]
/usr/local/lib/libtesseract.so.3(_ZN9tesseract17TrainingSampleSet22OrganizeByFontAndClassEv+0x24)[0x308af4]
/usr/local/lib/libtesseract.so.3(_ZN9tesseract13MasterTrainer24ReplaceFragmentedSamplesEv+0x146)[0x2f7a26]
/usr/local/lib/libtesseract.so.3(_ZN9tesseract13MasterTrainer15PostLoadCleanupEv+0x110)[0x2f7ca0]
shapeclustering[0x804e0c8]
shapeclustering(main+0x50)[0x804b6b0]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xb13113]
shapeclustering[0x804be39]
======= Memory map: ========
00110000-003e1000 r-xp 00000000 08:03 504810     /usr/local/lib/libtesseract.so.3.0.2
003e1000-003e5000 r--p 002d1000 08:03 504810     /usr/local/lib/libtesseract.so.3.0.2
003e5000-003e9000 rw-p 002d5000 08:03 504810     /usr/local/lib/libtesseract.so.3.0.2
003e9000-00482000 rw-p 00000000 00:00 0
00482000-00495000 r-xp 00000000 08:03 2909176    /lib/i386-linux-gnu/libz.so.1.2.3.4
00495000-00496000 r--p 00012000 08:03 2909176    /lib/i386-linux-gnu/libz.so.1.2.3.4
00496000-00497000 rw-p 00013000 08:03 2909176    /lib/i386-linux-gnu/libz.so.1.2.3.4
00497000-004bf000 r-xp 00000000 08:03 2908164    /lib/i386-linux-gnu/libpng12.so.0.46.0
004bf000-004c0000 r--p 00027000 08:03 2908164    /lib/i386-linux-gnu/libpng12.so.0.46.0
004c0000-004c1000 rw-p 00028000 08:03 2908164    /lib/i386-linux-gnu/libpng12.so.0.46.0
004c1000-004e3000 r-xp 00000000 08:03 1245488    /usr/lib/i386-linux-gnu/libjpeg.so.62.0.0
004e3000-004e4000 r--p 00022000 08:03 1245488    /usr/lib/i386-linux-gnu/libjpeg.so.62.0.0
004e4000-004e5000 rw-p 00023000 08:03 1245488    /usr/lib/i386-linux-gnu/libjpeg.so.62.0.0
005c9000-00629000 r-xp 00000000 08:03 426185     /usr/lib/i386-linux-gnu/libtiff.so.4.3.4
00629000-0062b000 r--p 0005f000 08:03 426185     /usr/lib/i386-linux-gnu/libtiff.so.4.3.4
0062b000-0062c000 rw-p 00061000 08:03 426185     /usr/lib/i386-linux-gnu/libtiff.so.4.3.4
0069b000-00779000 r-xp 00000000 08:03 1245598    /usr/lib/i386-linux-gnu/libstdc++.so.6.0.16
00779000-0077a000 ---p 000de000 08:03 1245598    /usr/lib/i386-linux-gnu/libstdc++.so.6.0.16
0077a000-0077e000 r--p 000de000 08:03 1245598    /usr/lib/i386-linux-gnu/libstdc++.so.6.0.16
0077e000-0077f000 rw-p 000e2000 08:03 1245598    /usr/lib/i386-linux-gnu/libstdc++.so.6.0.16
0077f000-00786000 rw-p 00000000 00:00 0
0079e000-00964000 r-xp 00000000 08:03 501220     /usr/local/lib/liblept.so.3.0.0
00964000-00965000 r--p 001c5000 08:03 501220     /usr/local/lib/liblept.so.3.0.0
00965000-0096c000 rw-p 001c6000 08:03 501220     /usr/local/lib/liblept.so.3.0.0
00aa8000-00ac6000 r-xp 00000000 08:03 2909080    /lib/i386-linux-gnu/ld-2.13.so
00ac6000-00ac7000 r--p 0001d000 08:03 2909080    /lib/i386-linux-gnu/ld-2.13.so
00ac7000-00ac8000 rw-p 0001e000 08:03 2909080    /lib/i386-linux-gnu/ld-2.13.so
00ac9000-00ae0000 r-xp 00000000 08:03 2909153    /lib/i386-linux-gnu/libpthread-2.13.so
00ae0000-00ae1000 r--p 00016000 08:03 2909153    /lib/i386-linux-gnu/libpthread-2.13.so
00ae1000-00ae2000 rw-p 00017000 08:03 2909153    /lib/i386-linux-gnu/libpthread-2.13.so
00ae2000-00ae4000 rw-p 00000000 00:00 0
00afa000-00c70000 r-xp 00000000 08:03 2909093    /lib/i386-linux-gnu/libc-2.13.so
00c70000-00c72000 r--p 00176000 08:03 2909093    /lib/i386-linux-gnu/libc-2.13.so
00c72000-00c73000 rw-p 00178000 08:03 2909093    /lib/i386-linux-gnu/libc-2.13.so
00c73000-00c76000 rw-p 00000000 00:00 0
00e6a000-00e86000 r-xp 00000000 08:03 2909114    /lib/i386-linux-gnu/libgcc_s.so.1
00e86000-00e87000 r--p 0001b000 08:03 2909114    /lib/i386-linux-gnu/libgcc_s.so.1
00e87000-00e88000 rw-p 0001c000 08:03 2909114    /lib/i386-linux-gnu/libgcc_s.so.1
00f22000-00f23000 r-xp 00000000 00:00 0          [vdso]
00fbe000-00fe6000 r-xp 00000000 08:03 2909123    /lib/i386-linux-gnu/libm-2.13.so
00fe6000-00fe7000 r--p 00028000 08:03 2909123    /lib/i386-linux-gnu/libm-2.13.so
00fe7000-00fe8000 rw-p 00029000 08:03 2909123    /lib/i386-linux-gnu/libm-2.13.so
08048000-08052000 r-xp 00000000 08:03 504822     /usr/local/bin/shapeclustering
08052000-08053000 r--p 00009000 08:03 504822     /usr/local/bin/shapeclustering
08053000-08054000 rw-p 0000a000 08:03 504822     /usr/local/bin/shapeclustering
08dc9000-08ed3000 rw-p 00000000 00:00 0          [heap]
b7700000-b7721000 rw-p 00000000 00:00 0
b7721000-b7800000 ---p 00000000 00:00 0
b781c000-b7821000 rw-p 00000000 00:00 0
b7830000-b7832000 rw-p 00000000 00:00 0
bfa90000-bfab1000 rw-p 00000000 00:00 0          [stack]
Aborted

Shree Devi Kumar

unread,
Oct 17, 2013, 4:25:35 AM10/17/13
to tesser...@googlegroups.com
There are changes in the training process from 3.01 to 3.02. In 3.02

Assuming that you have only one box/tiff pair and one tr file called
odia.lohit.exp0.tif
odia.lohit.exp0.box
odia.lohit.exp0.tr
that you have defined lohit as the font in your font_properties file as follows
lohit 0 0 0 0 0

 your commands should be on the following lines:

unicharset_extractor  odia.lohit.exp0.box
shapeclustering -F font_properties -U unicharset odia.lohit.exp0.tr
mftraining  -F font_properties -U unicharset -O odia.unicharset odia.lohit.exp0.tr
cntraining  odia.lohit.exp0.tr
mv normproto odia.normproto
mv inttemp odia.inttemp
mv pffmtable  odia.pffmtable
mv shapetable odia.shapetable
combine_tessdata .\odia.

 
 


Shree Devi Kumar
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com


--
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com
To unsubscribe from this group, send email to
tesseract-oc...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
 
---
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

mamata nayak

unread,
Oct 17, 2013, 4:47:04 AM10/17/13
to tesser...@googlegroups.com
sir,
can u suggest me should I use tesseract-3.01 or 3.02
another, already i have install 3.02, is it required to install 3.01
pls reply  


You received this message because you are subscribed to a topic in the Google Groups "tesseract-ocr" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tesseract-ocr/6u9oVazAIPE/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tesseract-oc...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages