Error in training Tesseract 4.0. Training gets completed somehow but then the output it gives after reading the pdf is incorrect.

602 views
Skip to first unread message

ada...@turningcloud.com

unread,
Feb 15, 2018, 2:22:07 AM2/15/18
to tesseract-ocr
adarsh@adarsh-X555LJ:~/tesseract$ training/tesstrain.sh --fonts_dir /usr/share/fonts --lang eng   --noextract_font_properties --langdata_dir /home/adarsh/tesseract/langdata --training_text /home/adarsh/tesseract/langdata/eng/eng.training_text --linedata_only   --tessdata_dir /home/tessdata/tessdata --output_dir ~/tesstutorial/engtrain  --overwrite

=== Starting training for language 'eng'
[Thu Feb 15 11:56:06 IST 2018] /usr/local/bin/text2image --fonts_dir=/usr/share/fonts --font=Arial Bold --outputbase=/tmp/font_tmp.zQ3JffkHYN/sample_text.txt --text=/tmp/font_tmp.zQ3JffkHYN/sample_text.txt --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN
Rendered page 0 to file /tmp/font_tmp.zQ3JffkHYN/sample_text.txt.tif

=== Phase I: Generating training images ===
Rendering using Arial Bold
Rendering using Arial Italic
Rendering using Arial
Rendering using Courier New Bold Italic
Rendering using Courier New
Rendering using Courier New Italic
Rendering using Courier New Bold
Rendering using Arial Bold Italic
[Thu Feb 15 11:56:27 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold_Italic.exp0 --max_pages=3 --font=Courier New Bold Italic --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:27 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Arial.exp0 --max_pages=3 --font=Arial --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:27 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Italic.exp0 --max_pages=3 --font=Arial Italic --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:27 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold.exp0 --max_pages=3 --font=Arial Bold --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:27 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New.exp0 --max_pages=3 --font=Courier New --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:27 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold.exp0 --max_pages=3 --font=Courier New Bold --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:27 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold_Italic.exp0 --max_pages=3 --font=Arial Bold Italic --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:27 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Italic.exp0 --max_pages=3 --font=Courier New Italic --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold.exp0.tif
Rendering using Times New Roman, Bold Italic
Rendering using Times New Roman, Italic
Rendering using Times New Roman,
Rendering using Times New Roman, Bold
Rendering using Georgia Bold
Rendering using Georgia Bold Italic
Rendering using Georgia Italic
Rendering using Georgia
[Thu Feb 15 11:56:35 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold_Italic.exp0 --max_pages=3 --font=Times New Roman, Bold Italic --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:35 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman.exp0 --max_pages=3 --font=Times New Roman, --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:35 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Italic.exp0 --max_pages=3 --font=Times New Roman, Italic --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:35 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold.exp0 --max_pages=3 --font=Georgia Bold --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:35 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold_Italic.exp0 --max_pages=3 --font=Georgia Bold Italic --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:35 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Italic.exp0 --max_pages=3 --font=Georgia Italic --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:35 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Georgia.exp0 --max_pages=3 --font=Georgia --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:35 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold.exp0 --max_pages=3 --font=Times New Roman, Bold --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold.exp0.tif
Rendering using Trebuchet MS Bold
Rendering using Trebuchet MS Bold Italic
Rendering using Verdana Bold
Rendering using Verdana Bold Italic
Rendering using Trebuchet MS
Rendering using Trebuchet MS Italic
Rendering using Verdana
Rendering using Verdana Italic
[Thu Feb 15 11:56:43 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold.exp0 --max_pages=3 --font=Trebuchet MS Bold --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:43 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Italic.exp0 --max_pages=3 --font=Trebuchet MS Italic --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:43 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold_Italic.exp0 --max_pages=3 --font=Trebuchet MS Bold Italic --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:43 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS.exp0 --max_pages=3 --font=Trebuchet MS --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:43 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold_Italic.exp0 --max_pages=3 --font=Verdana Bold Italic --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:43 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold.exp0 --max_pages=3 --font=Verdana Bold --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:43 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Verdana.exp0 --max_pages=3 --font=Verdana --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:43 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Italic.exp0 --max_pages=3 --font=Verdana Italic --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Italic.exp0.tif
Rendering using URW Bookman L Italic
Rendering using Century Schoolbook L Italic
Rendering using URW Bookman L Bold Italic
Rendering using Century Schoolbook L Bold Italic
Rendering using URW Bookman L Bold
Rendering using Century Schoolbook L Bold
Rendering using Century Schoolbook L Medium
Rendering using DejaVu Sans Ultra-Light
[Thu Feb 15 11:56:51 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Italic.exp0 --max_pages=3 --font=Century Schoolbook L Italic --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:51 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold_Italic.exp0 --max_pages=3 --font=URW Bookman L Bold Italic --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:51 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold.exp0 --max_pages=3 --font=Century Schoolbook L Bold --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:51 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Italic.exp0 --max_pages=3 --font=URW Bookman L Italic --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:51 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold.exp0 --max_pages=3 --font=URW Bookman L Bold --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:51 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Medium.exp0 --max_pages=3 --font=Century Schoolbook L Medium --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:51 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0 --max_pages=3 --font=Century Schoolbook L Bold Italic --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
[Thu Feb 15 11:56:51 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.DejaVu_Sans_Ultra-Light.exp0 --max_pages=3 --font=DejaVu Sans Ultra-Light --text=/home/adarsh/tesseract/langdata/eng/eng.training_text
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Medium.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.DejaVu_Sans_Ultra-Light.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold.exp0.tif
Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.DejaVu_Sans_Ultra-Light.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Medium.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold.exp0.tif

=== Phase UP: Generating unicharset and unichar properties files ===
[Thu Feb 15 11:57:00 IST 2018] /usr/local/bin/unicharset_extractor --output_unicharset /tmp/tmp.kisZVM4Xbo/eng/eng.unicharset --norm_mode 1 /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold_Italic.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Arial.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Italic.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Italic.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Medium.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold_Italic.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Italic.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.DejaVu_Sans_Ultra-Light.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold_Italic.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Italic.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold_Italic.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Italic.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold_Italic.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Italic.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold_Italic.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Italic.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold_Italic.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Medium.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.DejaVu_Sans_Ultra-Light.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana.exp0.box
Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Italic.exp0.box
Other case É of é is not in unicharset
Wrote unicharset file /tmp/tmp.kisZVM4Xbo/eng/eng.unicharset
[Thu Feb 15 11:57:00 IST 2018] /usr/local/bin/set_unicharset_properties -U /tmp/tmp.kisZVM4Xbo/eng/eng.unicharset -O /tmp/tmp.kisZVM4Xbo/eng/eng.unicharset -X /tmp/tmp.kisZVM4Xbo/eng/eng.xheights --script_dir=/home/adarsh/tesseract/langdata
Loaded unicharset of size 111 from file /tmp/tmp.kisZVM4Xbo/eng/eng.unicharset
Setting unichar properties
Other case É of é is not in unicharset
Setting script properties
Failed to load script unicharset from:/home/adarsh/tesseract/langdata/Latin.unicharset
Warning: properties incomplete for index 3 = d
Warning: properties incomplete for index 4 = i
Warning: properties incomplete for index 5 = f
Warning: properties incomplete for index 6 = e
Warning: properties incomplete for index 7 = r
Warning: properties incomplete for index 8 = n
Warning: properties incomplete for index 9 = t
Warning: properties incomplete for index 10 = N
Warning: properties incomplete for index 11 = w
Warning: properties incomplete for index 12 = A
Warning: properties incomplete for index 13 = c
Warning: properties incomplete for index 14 = l
Warning: properties incomplete for index 15 = s
Warning: properties incomplete for index 16 = p
Warning: properties incomplete for index 17 = a
Warning: properties incomplete for index 18 = g
Warning: properties incomplete for index 19 = 2
Warning: properties incomplete for index 20 = 3
Warning: properties incomplete for index 21 = T
Warning: properties incomplete for index 22 = o
Warning: properties incomplete for index 23 = S
Warning: properties incomplete for index 24 = v
Warning: properties incomplete for index 25 = ~
Warning: properties incomplete for index 26 = D
Warning: properties incomplete for index 27 = C
Warning: properties incomplete for index 28 = h
Warning: properties incomplete for index 29 = '
Warning: properties incomplete for index 30 = 7
Warning: properties incomplete for index 31 = «
Warning: properties incomplete for index 32 = :
Warning: properties incomplete for index 33 = #
Warning: properties incomplete for index 34 = 1
Warning: properties incomplete for index 35 = Z
Warning: properties incomplete for index 36 = _
Warning: properties incomplete for index 37 = M
Warning: properties incomplete for index 38 = u
Warning: properties incomplete for index 39 = m
Warning: properties incomplete for index 40 = P
Warning: properties incomplete for index 41 = H
Warning: properties incomplete for index 42 = O
Warning: properties incomplete for index 43 = (
Warning: properties incomplete for index 44 = )
Warning: properties incomplete for index 45 = q
Warning: properties incomplete for index 46 = y
Warning: properties incomplete for index 47 = |
Warning: properties incomplete for index 48 = U
Warning: properties incomplete for index 49 = 0
Warning: properties incomplete for index 50 = %
Warning: properties incomplete for index 51 = x
Warning: properties incomplete for index 52 = F
Warning: properties incomplete for index 53 = R
Warning: properties incomplete for index 54 = I
Warning: properties incomplete for index 55 = ,
Warning: properties incomplete for index 56 = !
Warning: properties incomplete for index 57 = E
Warning: properties incomplete for index 58 = b
Warning: properties incomplete for index 59 = \
Warning: properties incomplete for index 60 = 8
Warning: properties incomplete for index 61 = ?
Warning: properties incomplete for index 62 = &
Warning: properties incomplete for index 63 = ;
Warning: properties incomplete for index 64 = B
Warning: properties incomplete for index 65 = k
Warning: properties incomplete for index 66 = -
Warning: properties incomplete for index 67 = >
Warning: properties incomplete for index 68 = L
Warning: properties incomplete for index 69 = .
Warning: properties incomplete for index 70 = —
Warning: properties incomplete for index 71 = 4
Warning: properties incomplete for index 72 = »
Warning: properties incomplete for index 73 = €
Warning: properties incomplete for index 74 = W
Warning: properties incomplete for index 75 = J
Warning: properties incomplete for index 76 = é
Warning: properties incomplete for index 77 = 9
Warning: properties incomplete for index 78 = ®
Warning: properties incomplete for index 79 = $
Warning: properties incomplete for index 80 = 5
Warning: properties incomplete for index 81 = }
Warning: properties incomplete for index 82 = [
Warning: properties incomplete for index 83 = Y
Warning: properties incomplete for index 84 = §
Warning: properties incomplete for index 85 = "
Warning: properties incomplete for index 86 = {
Warning: properties incomplete for index 87 = ¢
Warning: properties incomplete for index 88 = /
Warning: properties incomplete for index 89 = Q
Warning: properties incomplete for index 90 = 6
Warning: properties incomplete for index 91 = G
Warning: properties incomplete for index 92 = ”
Warning: properties incomplete for index 93 = °
Warning: properties incomplete for index 94 = K
Warning: properties incomplete for index 95 = ¥
Warning: properties incomplete for index 96 = V
Warning: properties incomplete for index 97 = ©
Warning: properties incomplete for index 98 = z
Warning: properties incomplete for index 99 = +
Warning: properties incomplete for index 100 = =
Warning: properties incomplete for index 101 = £
Warning: properties incomplete for index 102 = <
Warning: properties incomplete for index 103 = ’
Warning: properties incomplete for index 104 = ‘
Warning: properties incomplete for index 105 = j
Warning: properties incomplete for index 106 = X
Warning: properties incomplete for index 107 = ]
Warning: properties incomplete for index 108 = *
Warning: properties incomplete for index 109 = “
Warning: properties incomplete for index 110 = @
Writing unicharset to file /tmp/tmp.kisZVM4Xbo/eng/eng.unicharset

=== Phase E: Generating lstmf files ===
Using TESSDATA_PREFIX=/home/tessdata/tessdata
[Thu Feb 15 11:57:00 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold.exp0 lstm.train
[Thu Feb 15 11:57:00 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold_Italic.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold_Italic.exp0 lstm.train
[Thu Feb 15 11:57:00 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Italic.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Italic.exp0 lstm.train
[Thu Feb 15 11:57:00 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Arial.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Arial.exp0 lstm.train
[Thu Feb 15 11:57:00 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold.exp0 lstm.train
[Thu Feb 15 11:57:00 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0 lstm.train
[Thu Feb 15 11:57:00 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Medium.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Medium.exp0 lstm.train
[Thu Feb 15 11:57:00 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Italic.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Italic.exp0 lstm.train
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Loaded 45/45 pages (1-45) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold.exp0.lstmf
Loaded 46/46 pages (1-46) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Italic.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Arial.exp0.lstmf
Loaded 46/46 pages (1-46) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Italic.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold.exp0.lstmf
Loaded 47/47 pages (1-47) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Medium.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold_Italic.exp0.lstmf
[Thu Feb 15 11:57:09 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold.exp0 lstm.train
[Thu Feb 15 11:57:09 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold_Italic.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold_Italic.exp0 lstm.train
[Thu Feb 15 11:57:09 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New.exp0 lstm.train
[Thu Feb 15 11:57:09 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.DejaVu_Sans_Ultra-Light.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.DejaVu_Sans_Ultra-Light.exp0 lstm.train
[Thu Feb 15 11:57:09 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold_Italic.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold_Italic.exp0 lstm.train
[Thu Feb 15 11:57:09 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Italic.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Italic.exp0 lstm.train
[Thu Feb 15 11:57:09 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold.exp0 lstm.train
[Thu Feb 15 11:57:09 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia.exp0 lstm.train
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Page 1
Page 1
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia.exp0.lstmf
Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold.exp0.lstmf
Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Italic.exp0.lstmf
Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng.DejaVu_Sans_Ultra-Light.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold_Italic.exp0.lstmf
Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold_Italic.exp0.lstmf
[Thu Feb 15 11:57:18 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold.exp0 lstm.train
[Thu Feb 15 11:57:18 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold.exp0 lstm.train
[Thu Feb 15 11:57:18 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Italic.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Italic.exp0 lstm.train
[Thu Feb 15 11:57:18 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold_Italic.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold_Italic.exp0 lstm.train
[Thu Feb 15 11:57:18 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold_Italic.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold_Italic.exp0 lstm.train
[Thu Feb 15 11:57:18 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS.exp0 lstm.train
[Thu Feb 15 11:57:18 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman.exp0 lstm.train
[Thu Feb 15 11:57:18 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Italic.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Italic.exp0 lstm.train
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman.exp0.lstmf
Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold.exp0.lstmf
Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Italic.exp0.lstmf
Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold_Italic.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold_Italic.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Italic.exp0.lstmf
[Thu Feb 15 11:57:27 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Italic.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Italic.exp0 lstm.train
[Thu Feb 15 11:57:27 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold_Italic.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold_Italic.exp0 lstm.train
[Thu Feb 15 11:57:27 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold_Italic.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold_Italic.exp0 lstm.train
[Thu Feb 15 11:57:27 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Italic.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Italic.exp0 lstm.train
[Thu Feb 15 11:57:27 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Italic.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Italic.exp0 lstm.train
[Thu Feb 15 11:57:27 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold.exp0 lstm.train
[Thu Feb 15 11:57:27 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana.exp0 lstm.train
[Thu Feb 15 11:57:27 IST 2018] /usr/bin/tesseract /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold.exp0.tif /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold.exp0 lstm.train
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Page 1
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Loaded 49/49 pages (1-49) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana.exp0.lstmf
Loaded 47/47 pages (1-47) of document /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold.exp0.lstmf
Loaded 48/48 pages (1-48) of document /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Italic.exp0.lstmf
Loaded 46/46 pages (1-46) of document /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold_Italic.exp0.lstmf
Loaded 49/49 pages (1-49) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold_Italic.exp0.lstmf
Loaded 49/49 pages (1-49) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Italic.exp0.lstmf
Loaded 49/49 pages (1-49) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold.exp0.lstmf
Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Italic.exp0.lstmf

=== Constructing LSTM training data ===
[Thu Feb 15 11:57:36 IST 2018] /usr/local/bin/combine_lang_model --input_unicharset /tmp/tmp.kisZVM4Xbo/eng/eng.unicharset --script_dir /home/adarsh/tesseract/langdata --words /home/adarsh/tesseract/langdata/eng/eng.wordlist --numbers /home/adarsh/tesseract/langdata/eng/eng.numbers --puncs /home/adarsh/tesseract/langdata/eng/eng.punc --output_dir /home/adarsh/tesstutorial/engtrain --lang eng
Loaded unicharset of size 111 from file /tmp/tmp.kisZVM4Xbo/eng/eng.unicharset
Setting unichar properties
Other case É of é is not in unicharset
Setting script properties
Failed to load script unicharset from:/home/adarsh/tesseract/langdata/Latin.unicharset
Warning: properties incomplete for index 3 = d
Warning: properties incomplete for index 4 = i
Warning: properties incomplete for index 5 = f
Warning: properties incomplete for index 6 = e
Warning: properties incomplete for index 7 = r
Warning: properties incomplete for index 8 = n
Warning: properties incomplete for index 9 = t
Warning: properties incomplete for index 10 = N
Warning: properties incomplete for index 11 = w
Warning: properties incomplete for index 12 = A
Warning: properties incomplete for index 13 = c
Warning: properties incomplete for index 14 = l
Warning: properties incomplete for index 15 = s
Warning: properties incomplete for index 16 = p
Warning: properties incomplete for index 17 = a
Warning: properties incomplete for index 18 = g
Warning: properties incomplete for index 19 = 2
Warning: properties incomplete for index 20 = 3
Warning: properties incomplete for index 21 = T
Warning: properties incomplete for index 22 = o
Warning: properties incomplete for index 23 = S
Warning: properties incomplete for index 24 = v
Warning: properties incomplete for index 25 = ~
Warning: properties incomplete for index 26 = D
Warning: properties incomplete for index 27 = C
Warning: properties incomplete for index 28 = h
Warning: properties incomplete for index 29 = '
Warning: properties incomplete for index 30 = 7
Warning: properties incomplete for index 31 = «
Warning: properties incomplete for index 32 = :
Warning: properties incomplete for index 33 = #
Warning: properties incomplete for index 34 = 1
Warning: properties incomplete for index 35 = Z
Warning: properties incomplete for index 36 = _
Warning: properties incomplete for index 37 = M
Warning: properties incomplete for index 38 = u
Warning: properties incomplete for index 39 = m
Warning: properties incomplete for index 40 = P
Warning: properties incomplete for index 41 = H
Warning: properties incomplete for index 42 = O
Warning: properties incomplete for index 43 = (
Warning: properties incomplete for index 44 = )
Warning: properties incomplete for index 45 = q
Warning: properties incomplete for index 46 = y
Warning: properties incomplete for index 47 = |
Warning: properties incomplete for index 48 = U
Warning: properties incomplete for index 49 = 0
Warning: properties incomplete for index 50 = %
Warning: properties incomplete for index 51 = x
Warning: properties incomplete for index 52 = F
Warning: properties incomplete for index 53 = R
Warning: properties incomplete for index 54 = I
Warning: properties incomplete for index 55 = ,
Warning: properties incomplete for index 56 = !
Warning: properties incomplete for index 57 = E
Warning: properties incomplete for index 58 = b
Warning: properties incomplete for index 59 = \
Warning: properties incomplete for index 60 = 8
Warning: properties incomplete for index 61 = ?
Warning: properties incomplete for index 62 = &
Warning: properties incomplete for index 63 = ;
Warning: properties incomplete for index 64 = B
Warning: properties incomplete for index 65 = k
Warning: properties incomplete for index 66 = -
Warning: properties incomplete for index 67 = >
Warning: properties incomplete for index 68 = L
Warning: properties incomplete for index 69 = .
Warning: properties incomplete for index 70 = —
Warning: properties incomplete for index 71 = 4
Warning: properties incomplete for index 72 = »
Warning: properties incomplete for index 73 = €
Warning: properties incomplete for index 74 = W
Warning: properties incomplete for index 75 = J
Warning: properties incomplete for index 76 = é
Warning: properties incomplete for index 77 = 9
Warning: properties incomplete for index 78 = ®
Warning: properties incomplete for index 79 = $
Warning: properties incomplete for index 80 = 5
Warning: properties incomplete for index 81 = }
Warning: properties incomplete for index 82 = [
Warning: properties incomplete for index 83 = Y
Warning: properties incomplete for index 84 = §
Warning: properties incomplete for index 85 = "
Warning: properties incomplete for index 86 = {
Warning: properties incomplete for index 87 = ¢
Warning: properties incomplete for index 88 = /
Warning: properties incomplete for index 89 = Q
Warning: properties incomplete for index 90 = 6
Warning: properties incomplete for index 91 = G
Warning: properties incomplete for index 92 = ”
Warning: properties incomplete for index 93 = °
Warning: properties incomplete for index 94 = K
Warning: properties incomplete for index 95 = ¥
Warning: properties incomplete for index 96 = V
Warning: properties incomplete for index 97 = ©
Warning: properties incomplete for index 98 = z
Warning: properties incomplete for index 99 = +
Warning: properties incomplete for index 100 = =
Warning: properties incomplete for index 101 = £
Warning: properties incomplete for index 102 = <
Warning: properties incomplete for index 103 = ’
Warning: properties incomplete for index 104 = ‘
Warning: properties incomplete for index 105 = j
Warning: properties incomplete for index 106 = X
Warning: properties incomplete for index 107 = ]
Warning: properties incomplete for index 108 = *
Warning: properties incomplete for index 109 = “
Warning: properties incomplete for index 110 = @
Config file is optional, continuing...
Failed to read data from: /home/adarsh/tesseract/langdata/eng/eng.config
Failed to read data from: /home/adarsh/tesseract/langdata/radical-stroke.txt
Error reading radical code table /home/adarsh/tesseract/langdata/radical-stroke.txt
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Arial.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Medium.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.DejaVu_Sans_Ultra-Light.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain

Completed training for language 'eng'

ShreeDevi Kumar

unread,
Feb 15, 2018, 3:05:15 AM2/15/18
to tesser...@googlegroups.com
You are missing langdata files

Failed to load script unicharset from:/home/adarsh/tesseract/langdata/Latin.unicharset

Failed to read data from: /home/adarsh/tesseract/langdata/radical-stroke.txt
Error reading radical code table /home/adarsh/tesseract/langdata/radical-stroke.txt

Even after you fix the above, this is only first step of LSTM training process. 

It creates a starter traineddata and lstmf files to be used by lstmtraining. 

The starter traineddata cannot be used to OCR.

Please read wiki pages regarding training 4.0



ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/baa11e4f-b5a8-42cf-827c-6901073af746%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Adarsh Shukla

unread,
Feb 15, 2018, 5:07:13 AM2/15/18
to tesser...@googlegroups.com
Thanks alot for replying shree.
I will be asking more doubtsin future because of people like you.
Ill revert back if the problem still exists. Thanks a lot.

Regards

Adarsh

REGARDS
ADARSH SHUKLA
Junior Developer Trainee
TURNING CLOUD SOLUTIONS
+91 9717783099

--
You received this message because you are subscribed to a topic in the Google Groups "tesseract-ocr" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tesseract-ocr/rUcvZ_AhCMI/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tesseract-ocr+unsubscribe@googlegroups.com.

To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

Adarsh Shukla

unread,
Feb 15, 2018, 5:17:47 AM2/15/18
to tesser...@googlegroups.com
adarsh@adarsh-X555LJ:~/tesseract$ training/tesstrain.sh --fonts_dir /usr/share/fonts --lang eng   --noextract_font_properties --langdata_dir /home/adarsh/tesseract/langdata --training_text /home/adarsh/tesseract/langdata/eng.training_text --linedata_only   --tessdata_dir /home/tessdata/tessdata --output_dir ~/tesstutorial/engtrain  --overwrite


=== Starting training for language 'eng'
[Thu Feb 15 15:41:35 IST 2018] /usr/local/bin/text2image --fonts_dir=/usr/share/fonts --font=Arial Bold --outputbase=/tmp/font_tmp.728VutTJdy/sample_text.txt --text=/tmp/font_tmp.728VutTJdy/sample_text.txt --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy
Rendered page 0 to file /tmp/font_tmp.728VutTJdy/sample_text.txt.tif


=== Phase I: Generating training images ===
Rendering using Arial Bold Italic
Rendering using Arial
Rendering using Arial Bold

Rendering using Courier New Bold Italic
Rendering using Courier New
Rendering using Courier New Italic
Rendering using Arial Italic

Rendering using Courier New Bold
[Thu Feb 15 15:41:56 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Courier_New.exp0 --max_pages=3 --font=Courier New --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:41:56 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold_Italic.exp0 --max_pages=3 --font=Courier New Bold Italic --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:41:56 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Arial.exp0 --max_pages=3 --font=Arial --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:41:56 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Italic.exp0 --max_pages=3 --font=Courier New Italic --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:41:56 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold_Italic.exp0 --max_pages=3 --font=Arial Bold Italic --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:41:56 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold.exp0 --max_pages=3 --font=Arial Bold --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:41:56 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Arial_Italic.exp0 --max_pages=3 --font=Arial Italic --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:41:56 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold.exp0 --max_pages=3 --font=Courier New Bold --text=/home/adarsh/tesseract/langdata/eng.training_text
Fontconfig error: line 1: no element found
Fontconfig error: Cannot load default config file
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Arial.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Arial.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold.exp0.tif

Rendering using Times New Roman, Bold Italic
Rendering using Times New Roman,
Rendering using Times New Roman, Bold
Rendering using Georgia Bold Italic
Rendering using Georgia
Rendering using Georgia Italic

Rendering using Times New Roman, Italic
Rendering using Georgia Bold
[Thu Feb 15 15:42:05 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold.exp0 --max_pages=3 --font=Times New Roman, Bold --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:05 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold_Italic.exp0 --max_pages=3 --font=Georgia Bold Italic --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:05 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman.exp0 --max_pages=3 --font=Times New Roman, --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:05 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Georgia.exp0 --max_pages=3 --font=Georgia --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:05 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold_Italic.exp0 --max_pages=3 --font=Times New Roman, Bold Italic --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:05 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Italic.exp0 --max_pages=3 --font=Georgia Italic --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:05 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Italic.exp0 --max_pages=3 --font=Times New Roman, Italic --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:05 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold.exp0 --max_pages=3 --font=Georgia Bold --text=/home/adarsh/tesseract/langdata/eng.training_text
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Georgia.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Georgia.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold.exp0.tif

Rendering using Trebuchet MS Bold
Rendering using Verdana Bold
Rendering using Trebuchet MS Italic

Rendering using Verdana Bold Italic
Rendering using Trebuchet MS Bold Italic
Rendering using Trebuchet MS

Rendering using Verdana
Rendering using Verdana Italic
[Thu Feb 15 15:42:13 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold_Italic.exp0 --max_pages=3 --font=Trebuchet MS Bold Italic --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:13 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS.exp0 --max_pages=3 --font=Trebuchet MS --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:13 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold.exp0 --max_pages=3 --font=Trebuchet MS Bold --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:13 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Verdana.exp0 --max_pages=3 --font=Verdana --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:13 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold_Italic.exp0 --max_pages=3 --font=Verdana Bold Italic --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:13 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold.exp0 --max_pages=3 --font=Verdana Bold --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:13 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Italic.exp0 --max_pages=3 --font=Verdana Italic --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:13 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Italic.exp0 --max_pages=3 --font=Trebuchet MS Italic --text=/home/adarsh/tesseract/langdata/eng.training_text
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Verdana.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Verdana.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold.exp0.tif

Rendering using URW Bookman L Italic
Rendering using URW Bookman L Bold Italic
Rendering using Century Schoolbook L Bold Italic
Rendering using Century Schoolbook L Italic
Rendering using Century Schoolbook L Bold
Rendering using DejaVu Sans Ultra-Light

Rendering using Century Schoolbook L Medium
Rendering using URW Bookman L Bold
[Thu Feb 15 15:42:22 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold_Italic.exp0 --max_pages=3 --font=URW Bookman L Bold Italic --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:22 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold.exp0 --max_pages=3 --font=URW Bookman L Bold --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:22 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold.exp0 --max_pages=3 --font=Century Schoolbook L Bold --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:22 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0 --max_pages=3 --font=Century Schoolbook L Bold Italic --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:22 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Italic.exp0 --max_pages=3 --font=Century Schoolbook L Italic --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:22 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Italic.exp0 --max_pages=3 --font=URW Bookman L Italic --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:22 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Medium.exp0 --max_pages=3 --font=Century Schoolbook L Medium --text=/home/adarsh/tesseract/langdata/eng.training_text
[Thu Feb 15 15:42:22 IST 2018] /usr/local/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.728VutTJdy --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.QzzkE63Ons/eng/eng.DejaVu_Sans_Ultra-Light.exp0 --max_pages=3 --font=DejaVu Sans Ultra-Light --text=/home/adarsh/tesseract/langdata/eng.training_text
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Medium.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Italic.exp0.tif
Rendered page 0 to file /tmp/tmp.QzzkE63Ons/eng/eng.DejaVu_Sans_Ultra-Light.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Medium.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Italic.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.DejaVu_Sans_Ultra-Light.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold.exp0.tif
Rendered page 1 to file /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold.exp0.tif


=== Phase UP: Generating unicharset and unichar properties files ===
[Thu Feb 15 15:42:30 IST 2018] /usr/local/bin/unicharset_extractor --output_unicharset /tmp/tmp.QzzkE63Ons/eng/eng.unicharset --norm_mode 1 /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold_Italic.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Arial.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Italic.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Italic.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Medium.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold_Italic.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Italic.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.DejaVu_Sans_Ultra-Light.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold_Italic.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Georgia.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Italic.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold_Italic.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Italic.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold_Italic.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Italic.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold_Italic.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Italic.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold_Italic.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Verdana.exp0.box /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Arial.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Medium.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.DejaVu_Sans_Ultra-Light.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Georgia.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold_Italic.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Verdana.exp0.box
Extracting unicharset from box file /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Italic.exp0.box

Other case É of é is not in unicharset
Wrote unicharset file /tmp/tmp.QzzkE63Ons/eng/eng.unicharset
[Thu Feb 15 15:42:30 IST 2018] /usr/local/bin/set_unicharset_properties -U /tmp/tmp.QzzkE63Ons/eng/eng.unicharset -O /tmp/tmp.QzzkE63Ons/eng/eng.unicharset -X /tmp/tmp.QzzkE63Ons/eng/eng.xheights --script_dir=/home/adarsh/tesseract/langdata
Loaded unicharset of size 111 from file /tmp/tmp.QzzkE63Ons/eng/eng.unicharset
Writing unicharset to file /tmp/tmp.QzzkE63Ons/eng/eng.unicharset


=== Phase E: Generating lstmf files ===
Using TESSDATA_PREFIX=/home/tessdata/tessdata
[Thu Feb 15 15:42:30 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Arial.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Arial.exp0 lstm.train
[Thu Feb 15 15:42:30 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0 lstm.train
[Thu Feb 15 15:42:30 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Italic.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Italic.exp0 lstm.train
[Thu Feb 15 15:42:30 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold_Italic.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold_Italic.exp0 lstm.train
[Thu Feb 15 15:42:30 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Medium.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Medium.exp0 lstm.train
[Thu Feb 15 15:42:30 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold.exp0 lstm.train
[Thu Feb 15 15:42:30 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold.exp0 lstm.train
[Thu Feb 15 15:42:31 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Italic.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Italic.exp0 lstm.train

Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Loaded 52/52 pages (1-52) of document /tmp/tmp.QzzkE63Ons/eng/eng.Arial.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold_Italic.exp0.lstmf
Loaded 47/47 pages (1-47) of document /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Medium.exp0.lstmf
Loaded 46/46 pages (1-46) of document /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Italic.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Italic.exp0.lstmf
Loaded 46/46 pages (1-46) of document /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.lstmf
Loaded 45/45 pages (1-45) of document /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold.exp0.lstmf
[Thu Feb 15 15:42:40 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold.exp0 lstm.train
[Thu Feb 15 15:42:40 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold_Italic.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold_Italic.exp0 lstm.train
[Thu Feb 15 15:42:40 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Italic.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Italic.exp0 lstm.train
[Thu Feb 15 15:42:40 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New.exp0 lstm.train
[Thu Feb 15 15:42:40 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold.exp0 lstm.train
[Thu Feb 15 15:42:40 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold_Italic.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold_Italic.exp0 lstm.train
[Thu Feb 15 15:42:40 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Georgia.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Georgia.exp0 lstm.train
[Thu Feb 15 15:42:40 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.DejaVu_Sans_Ultra-Light.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.DejaVu_Sans_Ultra-Light.exp0 lstm.train

Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Page 1
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Loaded 51/51 pages (1-51) of document /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.QzzkE63Ons/eng/eng.Georgia.exp0.lstmf
Loaded 51/51 pages (1-51) of document /tmp/tmp.QzzkE63Ons/eng/eng.DejaVu_Sans_Ultra-Light.exp0.lstmf
Loaded 51/51 pages (1-51) of document /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold_Italic.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold_Italic.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold.exp0.lstmf
Loaded 51/51 pages (1-51) of document /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold.exp0.lstmf
Loaded 51/51 pages (1-51) of document /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Italic.exp0.lstmf
[Thu Feb 15 15:42:49 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold_Italic.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold_Italic.exp0 lstm.train
[Thu Feb 15 15:42:49 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Italic.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Italic.exp0 lstm.train
[Thu Feb 15 15:42:49 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold.exp0 lstm.train
[Thu Feb 15 15:42:49 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold_Italic.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold_Italic.exp0 lstm.train
[Thu Feb 15 15:42:49 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman.exp0 lstm.train
[Thu Feb 15 15:42:49 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS.exp0 lstm.train
[Thu Feb 15 15:42:49 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold.exp0 lstm.train
[Thu Feb 15 15:42:49 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Italic.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Italic.exp0 lstm.train

Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Page 1
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Loaded 52/52 pages (1-52) of document /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold_Italic.exp0.lstmf
Loaded 51/51 pages (1-51) of document /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Italic.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold.exp0.lstmf
Loaded 52/52 pages (1-52) of document /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Italic.exp0.lstmf
Loaded 51/51 pages (1-51) of document /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold.exp0.lstmf
Loaded 51/51 pages (1-51) of document /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold_Italic.exp0.lstmf
[Thu Feb 15 15:42:58 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold.exp0 lstm.train
[Thu Feb 15 15:42:58 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold_Italic.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold_Italic.exp0 lstm.train
[Thu Feb 15 15:42:58 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold.exp0 lstm.train
[Thu Feb 15 15:42:58 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Italic.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Italic.exp0 lstm.train
[Thu Feb 15 15:42:58 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Italic.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Italic.exp0 lstm.train
[Thu Feb 15 15:42:58 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold_Italic.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold_Italic.exp0 lstm.train
[Thu Feb 15 15:42:58 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Italic.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Italic.exp0 lstm.train
[Thu Feb 15 15:42:58 IST 2018] /usr/bin/tesseract /tmp/tmp.QzzkE63Ons/eng/eng.Verdana.exp0.tif /tmp/tmp.QzzkE63Ons/eng/eng.Verdana.exp0 lstm.train

Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 1
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Page 2
Loaded 47/47 pages (1-47) of document /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold.exp0.lstmf
Loaded 48/48 pages (1-48) of document /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Italic.exp0.lstmf
Loaded 46/46 pages (1-46) of document /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold_Italic.exp0.lstmf
Loaded 51/51 pages (1-51) of document /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Italic.exp0.lstmf
Loaded 49/49 pages (1-49) of document /tmp/tmp.QzzkE63Ons/eng/eng.Verdana.exp0.lstmf
Loaded 49/49 pages (1-49) of document /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Italic.exp0.lstmf
Loaded 49/49 pages (1-49) of document /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold_Italic.exp0.lstmf
Loaded 49/49 pages (1-49) of document /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold.exp0.lstmf


=== Constructing LSTM training data ===
[Thu Feb 15 15:43:07 IST 2018] /usr/local/bin/combine_lang_model --input_unicharset /tmp/tmp.QzzkE63Ons/eng/eng.unicharset --script_dir /home/adarsh/tesseract/langdata --words /home/adarsh/tesseract/langdata/eng/eng.wordlist --numbers /home/adarsh/tesseract/langdata/eng/eng.numbers --puncs /home/adarsh/tesseract/langdata/eng/eng.punc --output_dir /home/adarsh/tesstutorial/engtrain --lang eng
Failed to read data from: /home/adarsh/tesseract/langdata/eng/eng.wordlist
Failed to read data from: /home/adarsh/tesseract/langdata/eng/eng.punc
Failed to read data from: /home/adarsh/tesseract/langdata/eng/eng.numbers
Loaded unicharset of size 111 from file /tmp/tmp.QzzkE63Ons/eng/eng.unicharset
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Bold_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Arial.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Arial_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Century_Schoolbook_L_Medium.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Bold_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Courier_New_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.DejaVu_Sans_Ultra-Light.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Bold_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Georgia.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Georgia_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Bold_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Times_New_Roman_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Bold_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Trebuchet_MS_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Bold_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.URW_Bookman_L_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Bold_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Verdana.exp0.lstmf to /home/adarsh/tesstutorial/engtrain
Moving /tmp/tmp.QzzkE63Ons/eng/eng.Verdana_Italic.exp0.lstmf to /home/adarsh/tesstutorial/engtrain


Completed training for language 'eng'

---------------------------------------------------------------------------------------------------------

Could anyone possibly tell the reason. I have fixed the Langdata folder now. And also the previous files are different from the file now.
Hope you will reply.

REGARDS
ADARSH SHUKLA
Junior Developer Trainee
TURNING CLOUD SOLUTIONS
+91 9717783099

On Thu, Feb 15, 2018 at 1:34 PM, ShreeDevi Kumar <shree...@gmail.com> wrote:

--
You received this message because you are subscribed to a topic in the Google Groups "tesseract-ocr" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tesseract-ocr/rUcvZ_AhCMI/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tesseract-ocr+unsubscribe@googlegroups.com.

To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

ShreeDevi Kumar

unread,
Feb 15, 2018, 5:36:33 AM2/15/18
to tesser...@googlegroups.com
>  I have fixed the Langdata folder now. And also the previous files are different from the file now.

Look at the error messages. 
Search for 'Failed'

You now have more langdata related errors. 

Adarsh Shukla

unread,
Feb 15, 2018, 6:12:31 AM2/15/18
to tesser...@googlegroups.com
adarsh@adarsh-X555LJ:~/tesseract$ mftraining -F font_properties -U unicharset -O lang.unicharset lang.fontname.exp0.tr lang.fontname.exp1.tr ...
Warning: No shape table file present: shapetable
Failed to load font_properties from font_properties

One more error showing during mf training.
can you tell the reason.

Also for the last problem here is what i searched on tesseract-ocr wiki:

attached below.

REGARDS
ADARSH SHUKLA
Junior Developer Trainee
TURNING CLOUD SOLUTIONS
+91 9717783099

--
You received this message because you are subscribed to a topic in the Google Groups "tesseract-ocr" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tesseract-ocr/rUcvZ_AhCMI/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
shree1.png
Reply all
Reply to author
Forward
0 new messages