Incorporating Tesseract in C# Application

75 views
Skip to first unread message

Richard Arnold

unread,
Jun 29, 2014, 2:53:25 PM6/29/14
to tesser...@googlegroups.com
Hello All. I am new to using Tesseract and have several questions I need to ask, so please be patient with me.

I want to incorporate Tesseract into a C#.NET application with straight output to a text file. I am using Visual Studio 2010.

  1. Should I use the Tesseract API or just tesseract-ocr and leptonica library?
  2. Can Tesseract \ API be compiled using Visual Studio 2010?
  3. Are there any special steps in compiling the API that I should know about?
My development platform is 64-bit Windows 8.1.
  1. Can my Visual Studio 2010 C# application be set to use Any CPU, i.e. 32-bit and 64-bit, or does it have to specifically target 64-bit user platforms?
When testing a scan containing an official government seal (see attached Word docx) and attempting to OCR the form using freeOCR, the government seal is converted to gibberish, i.e. symbols and stray characters.
  1. Is the seal considered an image?
  2. How can I instruct Tesseract to ignore this seal?
Thank you


Local government seal.docx

Paul

unread,
Jul 2, 2014, 5:55:22 AM7/2/14
to tesser...@googlegroups.com
There's a .NET wrapper around Tesseract on GitHub: https://github.com/charlesw/tesseract
It's using Visual Studio 2012 though. I don't know if that might cause problems.

I guess you can use the 32 bit version of Tesseract on both 32 and 64 bit Windows.
Reply all
Reply to author
Forward
0 new messages