The easiest way to train MICR CMC-7 font for Tesseract would be using OCR-D (
https://github.com/OCR-D/ocrd-train). This is what we've used in our R&D project (
https://github.com/DoubangoTelecom/tesseractMICR). We open sourced the MICR E-13B traineddata but not the CMC-7. We're not using these models in our products but the result is more accurate than any commercial product you can find online (
LEADTOLS, accusoft, recogniform and abbyy). You'll also need heavy pre-processing to fill the interspaces. If you're familiar with Tensorflow then, I'd recommend using it instead of Tesseract.
On Thursday, April 2, 2020 at 8:22:44 PM UTC+2, Ghada Aruri wrote:
Hi team,
For CMC-7, I want to train it by using jTessBoxEditor to get cmc7.traineddata what the steps to get the cmc7.traineddata?
and if anybody has done it and is willing to share me if you can?
Best Regards.