I'm trying to finetune Tesseract to recognize digits only but I'm not getting good results so far. I continued the training from Arabic language "ara" since the digits I'm trying to recognize are Arabic numbers.
The training will stop early at 0.01 error rate but the results on testing data is really bad.
I'm using my box/tif files and my training text with Tesstrain.h
Any recommendation on what should I do to get better results?