Really poor performance with decimal numbers

125 views
Skip to first unread message

Alberto Andreotti

unread,
Jul 6, 2018, 4:53:08 AM7/6/18
to tesseract-ocr
Hello,

I'm having problems with the simplest image possible.
It's a screenshot from GEdit(Ubuntu's text editor), with numbers and points. This is what I get,

23.78
15
1.6
17.6
25
225
2235
0.5

Alberto

version: tesseract 4.0.0-beta.1-285-g8d3f
run from command line like this, tesseract test_image2.png  outputbase --oem 1 --psm 1
test_image2.png

Shree Devi Kumar

unread,
Jul 6, 2018, 10:38:45 AM7/6/18
to tesser...@googlegroups.com
try --psm 6

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/8d743eca-7a7c-4add-b754-c79b6ea55cba%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


--

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

Alberto Andreotti

unread,
Jul 6, 2018, 11:52:08 AM7/6/18
to tesseract-ocr
Hi,

tried it with same results, also, all other cases work well.

23.78
15
1.6
1.7
1.2
1.3
1.4
1.8
1.9

The only that won't come out well is "1.5". That's pretty crazy. Any config I may provide or something?

thanks,
Alberto.
Message has been deleted

Lorenzo Bolzani

unread,
Jul 6, 2018, 12:52:22 PM7/6/18
to tesser...@googlegroups.com

Hi,
upscale and enhance contrast, but upscale is what really matters: each letter is 20px, a dot is about three pixel, it's probably "seen" as noise.

Bye

Lorenzo

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
test_image2.png

aupa...@gmail.com

unread,
Jan 3, 2019, 12:21:53 AM1/3/19
to tesseract-ocr
Hello everybody,

did anybody get this "solved". I played a lot with upscaling, gamma changes, contrast etc. but I keep on getting errors, in particular missing decimal points even though the point seem to be very well and the image of good quality. Is any param change of help?

On Friday, July 6, 2018 at 6:52:22 PM UTC+2, Lorenzo Blz wrote:

Hi,
upscale and enhance contrast, but upscale is what really matters: each letter is 20px, a dot is about three pixel, it's probably "seen" as noise.

Bye

Lorenzo
2018-07-06 5:51 GMT+02:00 Alberto Andreotti <albertoa...@gmail.com>:
Hello,

I'm having problems with the simplest image possible.
It's a screenshot from GEdit(Ubuntu's text editor), with numbers and points. This is what I get,

23.78
15
1.6
17.6
25
225
2235
0.5

Alberto

version: tesseract 4.0.0-beta.1-285-g8d3f
run from command line like this, tesseract test_image2.png  outputbase --oem 1 --psm 1

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.

易鑫

unread,
Jan 3, 2019, 12:59:00 AM1/3/19
to tesser...@googlegroups.com
Please upload the images then we can use then to try.

<aupa...@gmail.com> 于2019年1月3日周四 下午1:21写道:
Reply all
Reply to author
Forward
0 new messages