Error:Assert failed:in file stringrenderer.cpp, line 546

63 views
Skip to first unread message

chadme...@gmail.com

unread,
Jan 30, 2018, 3:40:08 PM1/30/18
to tesseract-ocr
I got an error when running the following command:

training/tesstrain.sh --fonts_dir /usr/share/fonts --lang chi_sim --linedata_only   
--noextract_font_properties --langdata_dir langdata   
--tessdata_dir tessdata   --fontlist "AR PL UKai CN" 
--output_dir /home/chad/tesseract40/tesstutorial/chi_sim_test/chi_sim_0004X   -
-training_text  /home/chad/tesseract40/files/chi_sim_0004X.txt

Error detail:

=== Starting training for language 'chi_sim'
[2018年 01月 31日 星期三 03:54:16 CST] /usr/local/bin/text2image --fonts_dir=/usr/share/fonts 
--font=AR PL UKai CN --outputbase=/tmp/font_tmp.JoNjc7bJC4/sample_text.txt 
--text=/tmp/font_tmp.JoNjc7bJC4/sample_text.txt --fontconfig_tmpdir=/tmp/font_tmp.JoNjc7bJC4
Rendered page 0 to file /tmp/font_tmp.JoNjc7bJC4/sample_text.txt.tif

=== Phase I: Generating training images ===
Rendering using AR PL UKai CN
[2018年 01月 31日 星期三 03:54:30 CST] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.JoNjc7bJC4 --fonts_dir=/usr/share/fonts 
--strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.vOCQupVasn/chi_sim/chi_sim.AR_PL_UKai_CN.exp0 
--max_pages=1000 --font=AR PL UKai CN --text=/home/chad/tesseract40/files/chi_sim_0004X.txt
Rendered page 0 to file /tmp/tmp.vOCQupVasn/chi_sim/chi_sim.AR_PL_UKai_CN.exp0.tif
Rendered page 1 to file /tmp/tmp.vOCQupVasn/chi_sim/chi_sim.AR_PL_UKai_CN.exp0.tif
Rendered page 2 to file /tmp/tmp.vOCQupVasn/chi_sim/chi_sim.AR_PL_UKai_CN.exp0.tif
Rendered page 3 to file /tmp/tmp.vOCQupVasn/chi_sim/chi_sim.AR_PL_UKai_CN.exp0.tif
Rendered page 4 to file /tmp/tmp.vOCQupVasn/chi_sim/chi_sim.AR_PL_UKai_CN.exp0.tif
cluster_text.size() == start_byte_to_box.size():Error:Assert failed:in file stringrenderer.cpp, line 546
ERROR: /tmp/tmp.vOCQupVasn/chi_sim/chi_sim.AR_PL_UKai_CN.exp0.box does not exist or is not readable
ERROR: /tmp/tmp.vOCQupVasn/chi_sim/chi_sim.AR_PL_UKai_CN.exp0.box does not exist or is not readable

version is 4.0

How can i fix this?
chi_sim_0004X.txt
Reply all
Reply to author
Forward
0 new messages