Hello All. I am new to using Tesseract and have several questions I need to ask, so please be patient with me.
I want to incorporate Tesseract into a C#.NET application with straight output to a text file. I am using Visual Studio 2010.
- Should I use the Tesseract API or just tesseract-ocr and leptonica library?
- Can Tesseract \ API be compiled using Visual Studio 2010?
- Are there any special steps in compiling the API that I should know about?
My development platform is 64-bit Windows 8.1.
- Can my Visual Studio 2010 C# application be set to use Any CPU, i.e. 32-bit and 64-bit, or does it have to specifically target 64-bit user platforms?
When testing a scan containing an official government seal (see attached Word docx) and attempting to OCR the form using freeOCR, the government seal is converted to gibberish, i.e. symbols and stray characters.
- Is the seal considered an image?
- How can I instruct Tesseract to ignore this seal?
Thank you