Need suggestion to read screenshoot numbers

48 views

Skip to first unread message

Béci

unread,

Jun 29, 2016, 4:14:47 PM6/29/16

to tesseract-ocr

I'm using tesseract-ocr from emgucv, with the following options:

"tessedit_char_whitelist", "0123456789."

"load_system_dawg", "false"

"load_freq_dawg", "false"

When i'm using tesseract only (without cube engine), it sometimes gives me bad results.

An example of how I'm currently trying to handle and interpret the image:

I take the original image then:

1. add a border to it

2. enlarge it 3x using Lanzos4 (in my experinece 3x is the best probably because each pixel became 3x3 full pixel)

3. binaryze it in the middle between the darkest and lightest, and invert it if the forecolor is white.

It reads 1080 instead 480:

Original:

-> ->

Or it reads 1000 instead 400:

Original:

-> ->

Are there any Tesseract options that might improve this? Or have i done something wrong with the original image?

I also tested using cube-engine. I'm not looking for suggestions reated to that, but a few words about my experiences.

If i'm using only cubeenegine, it gives me garbage most of the time.

If i'm using both combined, it gives me better results, but sometimes very unexpected results, like this (i know parameters like tessedit_char_whitelist not in effect in cube-engine mode)

Like its read "M)" instead 40:

Original:

-> ->

I also tried the following options, but i got the same results as above:

"chop_enable", "true");

"enable_new_segsearch", "0"

"language_model_ngram_on", "0"

Any suggestion welcome.

Reply all

Reply to author

Forward

0 new messages