Occasionally I need an OCR recognition tool to convert an *.png image to text.
It is very important that the base is a *.png image snapshotted from a webpage.
So this is NOT a fuzzy, blurred scan from a newpaper magazin and no text detection from a photo.
Its just a super simple clear source which needs to be converted to text.
I considered tesseract or a GUI on top using tesseract to do this job.
I used OCRget as GUI.
Surprisingly the output is extremely bad.
Have a look at snapshots of source and output below
Hardly any brackets are recognized.
Many many chars are miss spelled.
No Indents are kept.
Is task really to difficult?
I cannot believe this.
Does anyone have suggestions to improve recognition quality?

