The good news is that the training image generation program has
recently been added to the code repository[0] and works with regular
Linux distributions, as well as most[1] of the information needed to
recreate the training tif/box files[2]. If you can get that working,
you can just add your own training tif/box files alongside it.
I plan to update the TrainingTesseract3 wiki page soon to make this
clearer, but haven't done so yet.
APPLY_BOXES: boxfile line 5364/748 ((1488,893),(1532,6)): FAILURE! Couldn't find a matching blob
FAIL!
APPLY_BOXES: boxfile line 5365/1285 ((1494,1418),(1532,6)): FAILURE! Couldn't find a matching blob
FAIL!
APPLY_BOXES: boxfile line 5366/1552 ((1495,1626),(1529,6)): FAILURE! Couldn't find a matching blob
FAIL!
APPLY_BOXES: boxfile line 5367/1708 ((1494,1784),(1531,6)): FAILURE! Couldn't find a matching blob
FAIL!
APPLY_BOXES: boxfile line 5368/1970 ((1484,2101),(1532,6)): FAILURE! Couldn't find a matching blob
FAIL!
APPLY_BOXES: boxfile line 5369/2493 ((1494,2625),(1532,6)): FAILURE! Couldn't find a matching blob
training/text2image --text=trainingText.txt --outputbase=eng.courier.exp0 --font='Courier New' --fonts_dir=/Library/Fonts/ --ptsize=14 --char_spacing=2.5 --degrade_image=0