Hi all,
I have problems when using tesseract to recognize text from the uploaded image.
I used tesseract executable v4.0.0 released on 30. Oct. 2018 with best tessdata deu.traineddata and eng.traineddata. deu.traineddata was fine tuned with the font used in the image. I also used pytesseract as wrapper.
result = pytesseract.image_to_data(img, lang = 'deu+eng', config = '--psm 11')
I tried "lang = deu, psm = 10" as well. But none of them gave me satisfied results. 'y', 'a' and 'e' are always missed and 'ü', 'ö' and 'ä' are recognized as 'U', 'O' and 'a'.
Please help me out. Can you get good results?
Thank you and best regards,
Sa