I agree that Abbyy will do the job more accurate out of the box and is easier to get started with.
You may also want to have a look at this article: http://www.splitbrain.org/blog/2010-06/15-linux_ocr_software_comparison
On Wednesday, May 23, 2012 9:03:31 PM UTC+4, Scott Oom wrote:We are working on automated testing tools for applications and games.
We want to be able to verify various text in the UIs in different
languages and have been experimenting with Tesseract OCR and having a
lot of fun with it.
In 2007, Ray Smith mentioned that "Tesseract is now behind the leading
commercial engines in terms of its accuracy."
What commercial engines are more accurate than Tesseract and in what
ways? Can Tesseract OCR approach the commercial engines with training
and adjusting of parameters or is it still behind?