I am facing challenges with the accuracy of the OCR, and was hoping that someone could guide me through the process of debugging the problem so that I can apply these techniques to other OCR related issues that I face. Attached is a snippet of a document that is not correctly OCR'd. The output that I get is:
RE U'EST FO DICAL
The following config entries were added to configs/use-userdict
load_system_dawg F
load_freq_dawg F
load_punc_dawg F
load_number_dawg F
load_unambig_dawg F
load_bigram_dawg F
load_fixed_length_dawgs F
user_words_suffix user-words
tessedit_write_images T
tessedit_dump_pageseg_images T
and eng.user-words has the following entries
REQUEST
FOR
INDEPENDENT
MEDICAL
REVIEW
The following command line was used
tesseract test.png stdout -l eng use-userdict