training/lstmtraining --model_output /.../rus_new/ --continue_from /.../rus.lstm --train_listfile /.../list_of_files.txt --eval_listfile /.../list_of_files.txt --max_iterations 5000
/.../rus.Eskal_Font4You.exp0.tif
/.../rus.Eskal_Font4You.exp0.box
First document cannot be empty!!
num_pages_per_doc_ > 0:Error:Assert failed:in file imagedata.cpp, line 655
you are missing the .lstmf files
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/47875785-3322-4d5d-89fd-1818c2c06bc2%40googlegroups.com.
When using pre-existing box tiff pairs, you have to add a box with tab character to mark end of line and also add boxes with spaces after every word.You then need to generate the .lstmf files - please see training/tesstrain.sh for details.
ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
On Sat, May 6, 2017 at 4:40 PM, bmwmine <bmw...@gmail.com> wrote:
you are missing the .lstmf files
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
Should I add boxes with spaces before punctuation marks?
Also I've found this discussion:It helped me a lot, but I still got questions.What should I put in rus.training_text, if I want to generate .lstmf files from my own box/tiff pairs? Texts from images?
суббота, 6 мая 2017 г., 17:13:29 UTC+5 пользователь shree написал:When using pre-existing box tiff pairs, you have to add a box with tab character to mark end of line and also add boxes with spaces after every word.You then need to generate the .lstmf files - please see training/tesstrain.sh for details.ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.comOn Sat, May 6, 2017 at 4:40 PM, bmwmine <bmw...@gmail.com> wrote:you are missing the .lstmf files--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/47875785-3322-4d5d-89fd-1818c2c06bc2%40googlegroups.com.
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/35ecde41-d654-408d-bd98-7de37fc6684a%40googlegroups.com.
Thanks in advance.