Tesseract 3.02.2 having trouble with numeric values that contain decimal points

780 views
Skip to first unread message

Morlock

unread,
Aug 21, 2013, 12:44:45 PM8/21/13
to tesser...@googlegroups.com
Hello,
 
I'm using Tesseract v3.02.02.
 
I'm unable to get it to consistantly recognizing a number that contains a decimal point.  Tesseract is recognizing the digits.  Tesseract is recognizing the leading minus sign when there.
But, it is always throwing out the decimal point.  Has anyone else run into this and found a solution? I'm doing this all from a C++ executable on Ubuntu.
 
Here are the things I tried and did not help:
 
1. I normally use png files.  I've tried jpg and tiff but the results are the same.
2. I tried playing with contrast.  I made the background black and the color of the numbers bright yellow
3. I tried setting the tessedit_char_whitelist to ".-0123456789" explicitly
4. I've also use a TesseractRect to only OCR the numbers I'm interested in.
5. I tried using GetUNLVText instead of GetUTF8Text.  GetUNLVText was worse and just returned garbage.
6. The command-line tesseract command also gives me the same results; I even tried using the 'digits' configuration.
 
regards,m

Quan Nguyen

unread,
Aug 22, 2013, 8:04:10 PM8/22/13
to tesser...@googlegroups.com
Any example image?

Navanath Divate

unread,
Jun 13, 2014, 6:39:16 AM6/13/14
to tesser...@googlegroups.com
hey have got solution to detect decimal points,
i need that solution.

Perry Horwich

unread,
Jun 13, 2014, 11:23:29 AM6/13/14
to tesser...@googlegroups.com
Not sure if this is a good idea, but it would be easy and quick to try.
What is the matrix of your image?  Have you tried increasing the image size and then re-scanning?

Navanath Divate

unread,
Jun 16, 2014, 12:32:59 AM6/16/14
to tesser...@googlegroups.com
Hi perry,

i'm attaching my image from which i'm going to extract the text, my problem is my result ommited decimal point some time so plz help me out of this problem.
2.jpg

Paul

unread,
Jun 16, 2014, 8:34:05 PM6/16/14
to tesser...@googlegroups.com
The image you provided is broken.
Reply all
Reply to author
Forward
0 new messages