Error during training

71 views
Skip to first unread message

Alberto Ramirez

unread,
Mar 19, 2022, 2:59:31 AM3/19/22
to tesseract-ocr
this is the commands i use
tesseract eng.font1.exp0.tif train.my.exp0 batch.nochop makebox
tesseract eng.font1.exp0.tif train.my.exp0 box.train
unicharset_extractor train.my.exp0.box
echo "temp 0 0 1 0 0" > font_properties
mftraining -F font_properties -U unicharset -O eng.unicharset train.my.exp0.tr

This is the data from yt tutorial, i follow it 1:1, and as always, the dude from the movie get it done, and in my case i get a crash and errors from the picture.
img1.png

Zdenko Podobny

unread,
Mar 19, 2022, 6:32:20 AM3/19/22
to tesser...@googlegroups.com

so 19. 3. 2022 o 7:59 Alberto Ramirez <albert...@gmail.com> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/2c757396-cb9f-4ca4-b6c0-8c66cdcb3ea8n%40googlegroups.com.

Alberto Ramirez

unread,
Mar 19, 2022, 10:17:08 AM3/19/22
to tesseract-ocr
This doesn't explain why that thing  dosn't work, the first 4 steps work fine. I already tryied to use the first link, but the guide is too chaotic.. There are some steps missing, the errors messages which i got are missing as well. The only thing i understood is that the method i am trying to use is supposed to be old, but still idk what commands i should put in lstmtraining.

Zdenko Podobny

unread,
Mar 19, 2022, 12:30:36 PM3/19/22
to tesser...@googlegroups.com
Follow the latest official training instructions. Otherwise, nobody will help you.
What you show seems like faking (e.g. not really following and understanding) the training process for tesseract 3.x version.

Zdenko


so 19. 3. 2022 o 15:17 Alberto Ramirez <albert...@gmail.com> napísal(a):
This doesn't explain why that thing  dosn't work, the first 4 steps work fine. I already tryied to use the first link, but the guide is too chaotic.. There are some steps missing, the errors messages which i got are missing as well. The only thing i understood is that the method i am trying to use is supposed to be old, but still idk what commands i should put in lstmtraining.

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.

Alberto Ramirez

unread,
Mar 26, 2022, 12:28:54 PM3/26/22
to tesseract-ocr
This documentation sucks, the steps are not even complete, i am trying to understand how to make that unicharset, and ther's not even one example



Alberto Ramirez

unread,
Mar 26, 2022, 1:04:39 PM3/26/22
to tesseract-ocr
tesseract custom_font/train.my.exp0.tif custom_font/train.my.exp0 batch.nochop makebox
tesseract custom_font/train.my.exp0.tif custom_font/train.my.exp0 --psm 6 lstm.train
echo "temp 0 0 1 0 0" > custom_font/font_properties
unicharset_extractor custom_font/train.my.exp0.box

That's all i was able to "guess" by myself, what to do then

Alberto Ramirez

unread,
Mar 26, 2022, 4:23:29 PM3/26/22
to tesseract-ocr
If somebody ever has same issues, just download tesseract 4.0, finnally there are some good tutorials and guides which do work, for example: https://www.youtube.com/watch?v=1v8BPw0Dn0I, there's no point to waste the time on the 5.0 "documentation", it's too twisted for begginers, and if something doesn't work here's no one to help.

Alberto Ramirez

unread,
Mar 26, 2022, 4:27:16 PM3/26/22
to tesseract-ocr
Also all the softwares i was able to find to make training faster, don't work with ver 5
Reply all
Reply to author
Forward
0 new messages