ocr thermostat numbers

105 views
Skip to first unread message

James Okken

unread,
May 13, 2015, 1:50:06 PM5/13/15
to tesser...@googlegroups.com
hi everyone.

can tesseract pull the numbers off this thermostat picture attached? I've tried a number of things including making the photo a better quality, to no avail.

any help would be appreciated! thanks!!
resultw.jpg
result.jpg

Dmitri Silaev

unread,
May 14, 2015, 3:41:14 AM5/14/15
to tesser...@googlegroups.com
Hi James,

I can suggest a number of steps regarding connected component analysis but it's better you'd show the original photo images. Probably there are easier ways to get the numbers from them. Be aware also that Tesseract might not be the best way to read LCD segment displays. It can work well for you, though; it depends on source image specifics. Attach several samples.

Best regards,
Dmitri Silaev
www.CustomOCR.com





--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/0de8f0b4-dff2-44f0-bd91-bd0403e4d130%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Artur Augusto

unread,
May 14, 2015, 1:07:25 PM5/14/15
to tesser...@googlegroups.com
If you are trying to recognize it in "real time", you can try: https://github.com/arturaugusto/display_ocr

I have sucess using it to read digital multimeters.

Artur

--
Message has been deleted

James Okken

unread,
May 14, 2015, 1:22:49 PM5/14/15
to tesser...@googlegroups.com
Dmitri,

thanks very much for your response. any help would be huge!
anything you suggest for LCD segments would be huge too!

I've attached more of the original images.

thanks
try.jpg
temp__05m-14d__ 8h-00m.jpg
temp__05m-11d__22h-00m.jpg

Dmitri Silaev

unread,
May 18, 2015, 8:03:00 AM5/18/15
to tesser...@googlegroups.com
Hi James,

Here I think more effort needs to be taken for getting better source images. In principle, there are two alternatives:
- Get a good quality source image. Then you'll be able to handle it by means of relatively simple preprocessing. Maybe using ImageMagick. Probably you'll be able to use Tesseract.
- Let any arbitrary image to get to your pipeline. Prepare to develop (or order from a 3rd party) complex image processor, full-fledged programming, etc.

If you choose to go with the first, I suggest the following to be improved to simplify further OCR:
- Don't use JPEG. Because of that, there's massive bunch of compression artifacts in each of you images. Use lossless PNG instead.
- Improve lighting. Too dark shots result in overwhelming noise. Either external or use flash. Beware of flares, though. Experiment in order to get best shots.
- Try to hold camera evenly when shooting (fronto-parallel projection). Otherwise you'd need perspective correction as a preprocessing step. Or at least skew correction.
- LCD display area to occupy as much as possible area of the image, centered. Otherwise you'd need background removal, ROI detection or devise heuristics for locating reference points in the image.

If you fix all of the above, you'll probably be able to manage with the homemade ImageMagick scripts and Tesseract. You can send your sample images again, so that we can discuss what can be done further.

There's a number of training attempts for LCD display fonts on the internet - look for them. They seem to address fonts similar to yours, but in the end you'd probably need to train yourself.

Best regards,
Dmitri Silaev
www.CustomOCR.com





On Thu, May 14, 2015 at 8:17 PM, James Okken <jokk...@gmail.com> wrote:
Dmitri,

thanks very much for your response. any help would be huge!
anything you suggest for LCD segments would be huge too!

I've attached more of the original images.

thanks

James Okken

unread,
May 19, 2015, 4:46:38 PM5/19/15
to tesser...@googlegroups.com
Dmitri,

you're right I need to get better quality pictures first. i will work on it and hopefully come back with better ones

thanks!
Reply all
Reply to author
Forward
0 new messages