training/text2image --text=npn_training_text.txt --outputbase=npn.Exo.exp0 --font='Exo' --fonts_dir=/usr/share/fontstraining/text2image --text=npn_training_text.txt --outputbase=npn --fonts_dir=/usr/share/fonts --find_fonts --min_coverage=1.0 --render_per_font=true--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/9d7df5ab-e1ad-43a6-9d7b-d7ba4ef39951%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Please see tesstrain.sh script file in training directory.
It automates the whole training process.
On 31-Aug-2017 9:29 PM, "Dan9er" <dan9ert...@gmail.com> wrote:
Running--training/text2image --text=npn_training_text.txt --outputbase=npn.Exo.exp0 --font='Exo' --fonts_dir=/usr/share/fonts
gives the desired output of two files:But running this command for the 162 fonts I want to use is very time consuming and monotonous. I tried running this command:
- npn.Exo.exo0.tif
- npn.Exo.exp0.box
training/text2image --text=npn_training_text.txt --outputbase=npn --fonts_dir=/usr/share/fonts --find_fonts --min_coverage=1.0 --render_per_font=trueBut that only made files in this format: npn.{fontName}.tifHow do I automate making .tif AND .box files? Do I have to change the --outputbase to something different or do I have to make a .sh script?PS. I did run training/text2image --find_fonts with --render_per_font set to false, so I have a npn.fontlist.txt file on hand.
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/59709cd6-d819-4b3a-8988-405ab40e7e25%40googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/a8d711b0-15b2-43c0-9b36-6e92a0d1bc3b%40googlegroups.com.
Well you can face problems if you install several version of tesseract event to different location like /usr, usr/local, /opt (or you have to very careful and you have to be familiar with your system, linking to shared libs etc.).
So I would suggest you to install only one version of tesseract (and uninstall former version before installing new version).
If you want to have several version of tesseract (e.g. you want to compare OCR result) I would suggest you to compile them from source (e.g. in /usr/src) and not installed them. If you want to test particular version you can run it this way:
/usr/src/tesseract-3.03/api/tesseract eurotext.tif eurotext
/usr/src/tesseract-ocr.3.02/api/tesseract eurotext.tif eurotext
/usr/src/tesseract-3.03/api/tesseract is shell wrapper script, and it will take care that correct shared library is used (without installation...).
Zdenko
training-uninstall: @cd "$(top_builddir)/training" && $(MAKE) uninstall--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/3a2fccf4-26d2-4dbb-b716-9f271d487995%40googlegroups.com.