need best practice to OCR on documents with + sign that helps align the documents.
Any known practice?
See attached file that I'm trying to OCR and get perfect results.
Currently, I'm cropping at the top with 18 percent height of the document... and if needed remove the border using accusoft scanfix.
Then OCR just that, so I get some blank lines and then + + then numbers...
My problem right now is that when all chars are used, 1 becomes i because of that speck in this document. (I can de-speckle if there is no other way to improve)
If I use only digits.. only 0-9, then I get a weird result, I get an extra 5 just below the speck.
Is there simple a way to find this line and use a constant height to OCR this line? so the speck will not be in that rectangle?
Is there a way to get the positions of those + signs in pixels?
Please advise.
Thank you