+ /home/ubuntu/tesseract/src/training/tesstrain.sh --fonts_dir ../.fonts --lang sin --linedata_only --noextract_font_properties --langdata_dir ../langdata_lstm --tessdata_dir ../tessdata_best --fontlist FreeSerif --training_text ../langdata_lstm/sin/sin.training_text --workspace_dir /home/ubuntu/tmp/ --save_box_tiff --maxpages 1 --output_dir ../tesstutorial/sintest
=== Starting training for language 'sin'
[Tue Sep 4 03:21:08 UTC 2018] /home/ubuntu/tesseract/src/training/text2image --fonts_dir=../.fonts --font=FreeSerif --outputbase=/home/ubuntu/tmp//fc-cache/sample_text.txt --text=/home/ubuntu/tmp//fc-cache/sample_text.txt --fontconfig_tmpdir=/home/ubuntu/tmp//fc-cache
Rendered page 0 to file /home/ubuntu/tmp//fc-cache/sample_text.txt.tif
=== Phase I: Generating training images ===
Rendering using FreeSerif
[Tue Sep 4 03:21:10 UTC 2018] /home/ubuntu/tesseract/src/training/text2image --fontconfig_tmpdir=/home/ubuntu/tmp//fc-cache --fonts_dir=../.fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/sin-2018-09-04.Wa5/sin.FreeSerif.exp0 --max_pages=1 --font=FreeSerif --text=../langdata_lstm/sin/sin.training_text
Stripped 1 unrenderable words
Rendered page 0 to file /tmp/sin-2018-09-04.Wa5/sin.FreeSerif.exp0.tif
=== Phase UP: Generating unicharset and unichar properties files ===
[Tue Sep 4 03:21:11 UTC 2018] /home/ubuntu/tesseract/src/training/unicharset_extractor --output_unicharset /tmp/sin-2018-09-04.Wa5/sin.unicharset --norm_mode 2 /tmp/sin-2018-09-04.Wa5/sin.FreeSerif.exp0.box
Extracting unicharset from box file /tmp/sin-2018-09-04.Wa5/sin.FreeSerif.exp0.box
Wrote unicharset file /tmp/sin-2018-09-04.Wa5/sin.unicharset
[Tue Sep 4 03:21:11 UTC 2018] /home/ubuntu/tesseract/src/training/set_unicharset_properties -U /tmp/sin-2018-09-04.Wa5/sin.unicharset -O /tmp/sin-2018-09-04.Wa5/sin.unicharset -X /tmp/sin-2018-09-04.Wa5/sin.xheights --script_dir=../langdata_lstm
Loaded unicharset of size 111 from file /tmp/sin-2018-09-04.Wa5/sin.unicharset