I use tesseract-ocr-w64-setup-v4.0.0.20181030 and jTessBoxEditor-2.2.0 in windows10. I use 3 images for test,you can find it in the attach files sample.zip.
1. I use jTessBoxEditor to merge the 3 images.
The merged file name is "langyp.fontyp.exp0.tif"
2. generate box file
tesseract langyp.fontyp.exp0.tif langyp.fontyp.exp0 -l eng --psm 7 --oem 3 batch.nochop makebox
Then generate langyp.fontyp.exp0.box file
3.Open JTessBoxEditor -> Box Editor --> open langyp.fontyp.exp0.tif --> modify mistakes
4. generate font_properties
echo "fontyp 0 0 0 0 0" > font_properties
5. generate training file
tesseract langyp.fontyp.exp0.tif langyp.fontyp.exp0 -l eng --psm 7 --oem 3 nobatch box.train
6. generate charset file
unicharset_extractor langyp.fontyp.exp0.box
Then generate unicharset file
7. generate shape file
10.rename normproto fontyp.normproto
rename inttemp fontyp.inttemp
rename pffmtable fontyp.pffmtable
rename unicharset fontyp.unicharset
rename shapetable fontyp.shapetable
11.combine_tessdata fontyp.
12.Then you can get the fontyp.traineddata file
But when I follow these steps at step 7,after typing "shapeclustering -F font_properties -U unicharset -O langyp.unicharset langyp.fontyp.exp0.tr" this command, the teriminal does not have any output even though wating for more than 20 minutes.
If I skip the step 7 do step8, after typing "mftraining -F font_properties -U unicharset -O langyp.unicharset langyp.fontyp.exp0.tr" this command, only one warning "No shape table file present: shapetable"
Then, the teriminal does not have any output even though waiting for long time.