Hi,
I'm trying to OCR short words in the form of a letter, a space, 4 numbers.
I'm doing a lot of pre-processing to get the picture cleaned and so far I arrive to something like that:
My challenge is that tesseract is only detecting the numbers. I tried all the posisble PSM with the same result. The heading C is always ignored.
This is the command line that I am running:
tesseract -c tessedit_char_whitelist=" 0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ" output6.png stdout
I tried with tesseract 5.3.0 and tesseract 5.3.4-45-g87a15 with the same result.
I'm looking for some recommendations on what I can do better to help tesseract detecting the heading C correctly.
Thanks,
JMS