What kind of image which tesseract can read?

55 views
Skip to first unread message

easyma...@gmail.com

unread,
Jan 26, 2018, 4:05:54 AM1/26/18
to tesseract-ocr
I have curiosity about the limitation of tesseract in reading text in image. I have tested tesseract for image like pdf screenshot and I satisfied with the result.

But, when I tried tesseract for text in natural image or image that the text is manually added by user in random background (ex: googling interesting image and adding text in photoshop), tesseract has such difficulties to read it, of course because the variety of background. Therefore, I did some preprocessing like finding text edges, grayscaling images etc to make the image more uniform.

Then, I got interesting result.
I did 2 preprocessing, grayscaling and doing histogram equalization, and save the image with 2 different names then run tesseract to read it.
For the 1st image (grayscaling), tesseract can read 3 of 4 words correctly, and for the 2nd image, tesseract can read nothing.

And then, I stack the images side to side and save it as one image, then run tesseract to read it.
The result is, for the first image, tesseract can read correctly and for the 2nd image, tesseract can read 3/4 words.

So, what kind of image which tesseract can read?

note: Im using my own tesseract model in tesseract 4.0 with word_dawg= 0.5
photo_2018-01-26_15-30-40.jpg
Reply all
Reply to author
Forward
0 new messages