usr/share/tesseract-ocr/./tesstrain.sh \
--fonts_dir /usr/share/fonts \
--lang ben \
--linedata_only\
--noextract_font_properties \
--langdata_dir /home/jennil/Desktop/pro/langdata-master/ben\
--tessdata_dir /usr/share/tesseract-ocr/4.00/tessdata –output_dir /home/jennil/Desktop/pro/output/ben_output\
--fontlist “Lohit Bengali”
and here is the error
ERROR: Unrecognized argument --linedata_only--noextract_font_properties
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/3aef2479-d04f-4b80-8d3b-abec3d4a9468%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/37073e8b-f628-438c-b1b9-648e90c405b8%40googlegroups.com.
/usr/share/tesseract-ocr/./tesstrain.sh \
--fonts_dir /usr/share/fonts \
--lang ben \
--linedata_only \
--noextract_font_properties \
--langdata_dir /home/jennil/Desktop/pro/langdata-master/ben \
--tessdata_dir /usr/share/tesseract-ocr/4.00/tessdata \
-–output_dir /home/jennil/Desktop/pro/output/ben_output \
--fontlist “Lohit Bengali”
please do help
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/c841fc9d-e1e3-4905-a065-651320f40fa5%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWXu383FWz10VrpW__WW-eJpp5A%2BXNgRPLuDOFzxsEt6A%40mail.gmail.com.
needs two dashes,
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/c841fc9d-e1e3-4905-a065-651320f40fa5%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
----
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
needs two dashes,
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/c841fc9d-e1e3-4905-a065-651320f40fa5%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
----
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWXu383FWz10VrpW__WW-eJpp5A%2BXNgRPLuDOFzxsEt6A%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJxgoof-ysEQ%2BKfYC%2Bxzd31pCeWwfEGk0J6zp1Oi0LD69uBc2g%40mail.gmail.com.
I tried using Lohit Bengali and here is the command
/usr/share/tesseract-ocr/./tesstrain.sh --fonts_dir /usr/share/fonts --lang ben --linedata_only --noextract_font_properties --langdata_dir /home/jennil/Desktop/pro/langdata-master --tessdata_dir /usr/share/tesseract-ocr/4.00/tessdata --output_dir /home/jennil/Desktop/pro/output/ben_output --fontlist “Lohit Bengali”
and the error i got is
== Starting training for language 'ben'
[Mon Jul 23 01:18:01 EDT 2018] /usr/bin/text2image --fonts_dir=/usr/share/fonts --font=“Lohit --outputbase=/tmp/font_tmp.zAepRNq6Yo/sample_text.txt --text=/tmp/font_tmp.zAepRNq6Yo/sample_text.txt --fontconfig_tmpdir=/tmp/font_tmp.zAepRNq6Yo
Could not find font named “Lohit.
Pango suggested font FreeMono.
Please correct --font arg.
=== Phase I: Generating training images ===
Rendering using “Lohit
Rendering using Bengali”
[Mon Jul 23 01:18:16 EDT 2018] /usr/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zAepRNq6Yo --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.abQfzSYB19/ben/ben.Bengali”.exp0 --max_pages=3 --font=Bengali” --text=/home/jennil/Desktop/pro/langdata-master/ben/ben.training_text
[Mon Jul 23 01:18:16 EDT 2018] /usr/bin/text2image --fontconfig_tmpdir=/tmp/font_tmp.zAepRNq6Yo --fonts_dir=/usr/share/fonts --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 --outputbase=/tmp/tmp.abQfzSYB19/ben/ben.“Lohit.exp0 --max_pages=3 --font=“Lohit --text=/home/jennil/Desktop/pro/langdata-master/ben/ben.training_text
Could not find font named Bengali”.
Pango suggested font FreeMono.
Please correct --font arg.
Could not find font named “Lohit.
Pango suggested font FreeMono.
Please correct --font arg.
ERROR: /tmp/tmp.abQfzSYB19/ben/ben.Bengali”.exp0.box does not exist or is not readable
ERROR: /tmp/tmp.abQfzSYB19/ben/ben.“Lohit.exp0.box does not exist or is not readable
ERROR: /tmp/tmp.abQfzSYB19/ben/ben.“Lohit.exp0.box does not exist or is not readable
please help me out shreeshrii
I read the link, but still i got this confusion about the fonts...the lohit bengali font is already in the system, then why this thing is happening
some of the fonts that showed up when i wrote text2image --fonts_dir /usr/share/fonts --list_available_fontsare
01: Liberation Serif Italic
102: Likhan Medium
103: Lohit Assamese
104: Lohit Bengali
105: Lohit Devanagari
106: Lohit Gujarati
107: Lohit Gurmukhi
108: Lohit Kannada
109: Lohit Malayalam
110: Lohit Odia
111: Lohit Tamil
112: Lohit Tamil Classical
113: Lohit Telugu
114: Loma
115: Loma Bold
116: Loma Bold Oblique
117: Loma Oblique
118: Manjari
119: Manjari Bold
120: Manjari Thin
121: Meera
122: Mitra Mono
...
Lohit Bengali is in it, so please tell me why is the error, do i need to do something others too?
needs two dashes,
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/c841fc9d-e1e3-4905-a065-651320f40fa5%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
----
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWXu383FWz10VrpW__WW-eJpp5A%2BXNgRPLuDOFzxsEt6A%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJxgoof-ysEQ%2BKfYC%2Bxzd31pCeWwfEGk0J6zp1Oi0LD69uBc2g%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduXGxBoxwOH1sf6WgAPEY-hwBJoJ75bEHzPbU7GKrobUNA%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJxgoof0UyOER3mb8BHrZpfJATyEOyKWqhxN1zG-fOneDj%2Buig%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAMgOLLzzqHtKGXmQMh1Eg4ptqWOqMvG9psBh4MRf-e9bYLnTuw%40mail.gmail.com.