How do I go about improving the OCR of the form above? I have tried a lot of methods, such as erasing the lines, cropping out individual rows, etc, and none seem to improve the tesseract OCR performance.
The biggest problem is the text that I need (the field) seems to do OK, but the surrounding identifier is sometimes poor, which makes extraction difficult using regex.