New issue 589 by derick...@gmail.com: tesseract 3.0.1 return empty result
for some Very clear and big number image
http://code.google.com/p/tesseract-ocr/issues/detail?id=589
What steps will reproduce the problem?
1. Run tesseract 3.0.1.exe by:
tesseract.exe tmp.bmp output makebox nobatch digits
2. see output.box file for result
3.
What is the expected output? What do you see instead?
1. result should be "698".
2. actually tesseract always return empty result on this image and some
silimiar image that is very clear. But my tesseract can recognize some very
poor image.
What version of the product are you using? On what operating system?
3.0.1
Please provide any additional information below.
Attachments:
tmp.bmp 32.1 KB
And also tesseract is very sensitive to small noise even if the noise blob
is far away from clear blob. For example, if add a small black blob to
attached tmp.bmp at the left-bottom corner, it is far away from number
block. Then run tesseract, you will see tesseract may can return result
instead of empty result as stated above. But slight different noise blob
will impact the result: for example, some time, the result is "698", while
change the noise block a little( change the shape), the result may turn
into "598", while the noise block is not overlap with number blob at all.