Alcareru
unread,Jul 7, 2009, 2:01:01 AM7/7/09Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to tesseract-ocr
I'm a noob with this tesseract myself as well, but I can tell you
something I've experienced. The picture has to be big enough, but not
too big. So try different scaling factors (different scaling
algorithms are also worth trying if necessary). Too big might be just
as bad as too small. Also instead of just inverting the colors you
want to make the picture have black text on white background.
Tesseract doesn't like colors. Tesseract also don't like noise and
garbage so try to get rid of all, none black text things. Text that is
too close to other text might also give you problems, but it is also
very problematic to solve. Teaching the precise font used in the
picture to tesseract should also help. Also note that special
characters might not be interpreted correctly by tesseract if you
won't teach them to it (degree sign: º, for instance).