I have a large number of images that contain a single line of alphanumeric data. My scans so far have not been great, and I could use some assistance.
Several vars are turned off as recommended in the docs:
key.push_back("load_system_dawg");
val.push_back("false");
key.push_back("load_freq_dawg");
val.push_back("false");
These are set at initialization:
tess->Init(nullptr, "eng", tesseract::OEM_DEFAULT, nullptr, 0, &key, &val, false);
tess->SetPageSegMode(tesseract::PageSegMode::PSM_SINGLE_LINE);
Some images are close, such as this one:
...which is interpreted as "SZ2EC 3".
Other like this one return a blank string:
And then I have some like this one which is so close, but Tesseract removes the spaces between the letters, so this example results in "1201":
Thanks,
Stéphane