Error by Training

92 views
Skip to first unread message

BenLie

unread,
Dec 4, 2015, 11:24:30 AM12/4/15
to tesseract-ocr
Hello,

I'm new to tesseract. 
I use Tesseract 3.03 on Windows 8. 

I have a custom font named BenLie. I created a tif and box file with jTessBoxEditor. 

When I run the training i get an error, see fonterror.PNG. 

Then I tried to run the training with the console an get this:

D:\Downloads\jTessBoxEditor-1.4\jTessBoxEditor\samples\BenLie>shapeclustering.exe -F deu.font_properties -U unicharset deu.benlie.exp0.tr
Reading deu.benlie.exp0.tr ...
Font id = -1/0, class id = 1/77 on sample 0
font_id >= 0 && font_id < font_id_map_.SparseSize():Error:Assert failed:in file
..\..\classify\trainingsampleset.cpp, line 622

 My deu.font_properties contains

benlie 0 0 0 0 0
benlieb 0 1 0 0 0
benliebi 1 1 0 0 0
benliei 1 0 0 0

What am I doing wrong? 
Is this the right content for the font property file?

Best regards 
Benjamin
fonterror.PNG

Quan Nguyen

unread,
Dec 5, 2015, 9:56:16 AM12/5/15
to tesseract-ocr
The font_properties file looks correct. Apparently, shapeclustering has produced an exception. The image shows a bad file name "deu.benlie.expo.box.tr"; it should be "deu.benlie.expo.tr". So please delete the bad file and correct the file name of your input image and try again.
Reply all
Reply to author
Forward
0 new messages