Windows Setup: tesseract-ocr-setup-4.0.0-alpha.20170804.exe
Spanish Trained Data: https://github.com/tesseract-ocr/tessdata/raw/4.00/spa.traineddata
Command Used to OCR:
tesseract.exe ImageDoc.png output --oem 1 -l spa
Where ImageDoc.png is a Spanish Scanned Document
output is the text file output of OCRed text
In Spanish, character ‘o’ is recognized incorrectly as some round symbol. Attached input file is ImageDoc.png and Error screenshot
Character ‘o’ should be recognized correctly.