Hello,
I'm trying to finetune the end.traineddata model as the steps in the link:
https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00#fine-tuning-for-%C2%B1-a-few-charactersAs the tutorail shows, I fine tuning for ± a few characters following the steps.
But when I execute the first command, to generate new training and eval data:
training/tesstrain.sh --fonts_dir /usr/share/fonts --lang eng --linedata_only \
--noextract_font_properties --langdata_dir ../langdata \
--tessdata_dir ./tessdata --output_dir ~/tesstutorial/trainplusminus
An error is prompted:
Creation of encoded unicharset failed! While constructing LSTM training data.
More details refer to the image.
Can you help me? Thanks.