Groups keyboard shortcuts have been updated
Dismiss
See shortcuts

Re: Success story using tesseract

1,090 views
Skip to first unread message

zdenko podobny

unread,
Feb 4, 2013, 3:45:16 PM2/4/13
to tesser...@googlegroups.com
Have a look at wiki[1] - I tried to keep there list of known implementation of tesseract-ocr.
There is also interesting paper from Poznań Supercomputing and Networking Center[2], where they document their experience with tesseract and its training.

[2] http://lib.psnc.pl/dlibra/docmetadata?id=358&from=publication&showContent=true

Zdenko


On Fri, Feb 1, 2013 at 5:34 PM, Jakub Jaroš <jaros...@gmail.com> wrote:
Hello,

in our project, we would like to decide about using Tesseract for it or not. I would like to ask somebody who is successfully using Tesseract (or Ocropus combination) in any project. What kind of document are you using it on? Is your project successful? Do you think your project is successful in the view of your client?

Thanks for anybody who is willing to share some info.

Jakub

--
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com
To unsubscribe from this group, send email to
tesseract-oc...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
 
---
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
 
 

TP

unread,
Feb 5, 2013, 5:22:37 PM2/5/13
to tesser...@googlegroups.com
On Fri, Feb 1, 2013 at 8:34 AM, Jakub Jaroš <jaros...@gmail.com> wrote:
> in our project, we would like to decide about using Tesseract for it or not.
> I would like to ask somebody who is successfully using Tesseract (or Ocropus
> combination) in any project.

I was recently playing around with Wolfram Research's Mathematica [1].
While trying to figure out if it was possible to see the source for
some of their new Image Processing functions (sadly no I think, since
everything in Mathematica\9.0\SystemFiles\Kernel\SystemResources\Windows\ImageProcessing\
seems to be .mx rather than .m files), I stumbled upon the following:

C:\Program Files\Wolfram
Research\Mathematica\9.0\SystemFiles\Converters\Binaries\Windows-x86-64\tesseract.exe

C:\Program Files\Wolfram
Research\Mathematica\9.0\SystemFiles\Libraries\Windows\leptonlib.dll

and

C:\Program Files\Wolfram
Research\Mathematica\9.0\SystemFiles\Converters\Tesseract\tessdata\

(which contains 7 traineddata files)

Googling, I see that the TextRecognize function [2] is indeed
implemented using tesseract [3][4][5].

OT but fascinating for its use of various Mathematica and OpenCV image
processing operations see [6]. (One day I should try my own solution
using just leptonica).

[1] http://www.wolfram.com/mathematica/

[2] http://reference.wolfram.com/mathematica/ref/TextRecognize.html

[3] http://mathematica.stackexchange.com/questions/18683/applying-textrecognize-on-alpha-numerical-table

[4] http://stackoverflow.com/questions/8916732/mathematicas-textrecognize-not-up-to-par/8919253#8919253

"Implementing OCR without using TextRecognize"
[5] http://mathematica.stackexchange.com/questions/13097/implementing-ocr-without-using-textrecognize

"How to remove convexity defects in sudoku square"
[6] http://stackoverflow.com/questions/10196198/how-to-remove-convexity-defects-in-sudoku-square/10226971#10226971
Reply all
Reply to author
Forward
0 new messages