Hi all,
I'm using tesseract for number plate recognition.(openalpr) it passes single character to tesseract for recognition.
I found that recognition accuracy is very sensitive to holes on character.
if the character in binary image has one or more small holes on it, than its likely to get a wrong result.
for example, this "0" is falsely recognized as "Z"

but just a single pixel different, which opens the hole on its upper part, than its correctly recognized as "0"

some more examples are attached.
I can not predict where the holes going to be, because it caused by noise in image. so I think it should not be added into training samples.
Is there a way to fix it? to make recognition robust to small noise
thank you