lstm eval gives error 0% but tesseract fails to predict correctly

69 views
Skip to first unread message

Artur Maricato Curinga

unread,
Jul 15, 2019, 3:00:37 PM7/15/19
to tesseract-ocr
I'm trying to generate a model with OCR-B font to read documents MRZ, tried an overfitting test using 44 cropped characters and trained using the ocrd-training repo (https://github.com/OCR-D/ocrd-train) using train = eval img list or without the eval on training.

The training stops since lstmeval gives a 0% error rate, but testing the model with tesseract fails to predict the correct text on 15% of the images.
What is the difference between the lstmeval and the tesseract prediction? This behaviour is expected?
overfit.zip

Steven ZHOU

unread,
Mar 1, 2020, 2:20:02 AM3/1/20
to tesseract-ocr
Hi, I met the same problem. Have you found the reason or solutions? Thanks

在 2019年7月16日星期二 UTC+8上午3:00:37,Artur Maricato Curinga写道:
Reply all
Reply to author
Forward
0 new messages