Thanks for the link. All the symbols come from one font, in a book
that is a copy of the 1860 text. I am cutting out and pasting 15
copies of each symbol into a large image file.
There are a few quirks.
The uppercase symbols are scaled slightly larger than the lowercase,
should I insert the uppercase symbols into the image file, or is the
OCR able to scale the text? Also, some of the pages scanned are at
slightly different resolutions causing some symbols to be slightly
larger than others. Also some of the lowercase symbols do not
properly reflect the detail of the uppercase (i.e. curls become dots,
90 degree serifs become 45 degree wedges). I am trying to add the
symbols anyway, but some of them are unbelievably rare.
Also for the image file, I have been finding the best and only the
finest symbols for training. Should I also include some poorly
reproduced symbols to reflect what might actually be seen 25% of the
time? For example: a very large number of 'h' will appear as two
symbols because the arch is broken.
Tim Legg