base_checkpoint is not a recognition model, trying training checkpoint...

23 views
Skip to first unread message

ameera...@gmail.com

unread,
Mar 14, 2019, 3:24:51 PM3/14/19
to tesseract-ocr

Environment

Tesseract Version:
tesseract 4.0.0-beta.1
leptonica-1.75.3
libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 1.5.2) : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.0

Found AVX512BW
Found AVX512F
Found AVX2
Found AVX
Found SSE

Platform: uname -a output: Linux CUDA1 4.15.0-29-generic #31~16.04.1-Ubuntu SMP Wed Jul 18 08:54:04 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux

cat /etc/lsb-release
DISTRIB_ID=Ubuntu
DISTRIB_RELEASE=18.04
DISTRIB_CODENAME=bionic
DISTRIB_DESCRIPTION="Ubuntu 18.04 LTS"


Current Behavior: Trying to evaluate my model from base checkpoint, but it fails to load

lstmeval \
--model ./dotoutout/base_checkpoint
--traineddata ./dottrain/eng/eng.traineddata
--eval_listfile ./dottest/eng.training_files.txt

./dotoutout/base_checkpoint is not a recognition model, trying training checkpoint...
Failed to load language model from ./dottrain/eng/eng.traineddata!


Expected Behavior: Model should load from base_checkpoint.


Suggested Fix: The issue seems similar to Cannot continue from _checkpoint #1782  If I have to use a numbered checkpoint, then which checkpoint should I use? The one that has the latest time stamp with ls -l?


Unfortunately, evaluating a numbered checkpoint does not work either.

lstmeval \
--model ./dotoutout/base0.715_13171.checkpoint \
--traineddata ./dottrain/eng/eng.traineddata \
--eval_listfile ./dottest/eng.training_files.txt \


./dotoutout/base0.715_13171.checkpoint is not a recognition model, trying training checkpoint...
Failed to load model from: ./dotoutout/base0.715_13171.checkpoint



ameera...@gmail.com

unread,
Mar 14, 2019, 3:45:39 PM3/14/19
to tesseract-ocr
I renamed my eng.traineddata into dot.traineddata and stored it in my tessdata folder.  This command works

lstmeval --model ./4.00/tessdata/dot.traineddata \
--eval_listfile ./doteval/eng.training_files.txt


On Thursday, March 14, 2019 at 12:24:51 PM UTC-7, ameera...@gmail.com wrote
Reply all
Reply to author
Forward
0 new messages