Can tesseract be used just to detect whether an image has text?

91 views
Skip to first unread message

Evan Zamir

unread,
Mar 14, 2016, 3:02:58 PM3/14/16
to tesseract-ocr
I'm working on picture moderation tools, including auto-moderation of content. Since text is harder to moderate, we want to make sure all images with text get moderated by a human, but to do that we need a tool that can tell us which images have text. Doesn't need to be able to actually do OCR on the text. If tesseract does not have this capability, would appreciate suggestions where to look. Thanks!
-evan

Tom Morris

unread,
Mar 15, 2016, 12:09:58 PM3/15/16
to tesseract-ocr
On Monday, March 14, 2016 at 3:02:58 PM UTC-4, Evan Zamir wrote:
I'm working on picture moderation tools, including auto-moderation of content. Since text is harder to moderate, we want to make sure all images with text get moderated by a human, but to do that we need a tool that can tell us which images have text. Doesn't need to be able to actually do OCR on the text. If tesseract does not have this capability, would appreciate suggestions where to look. Thanks!

Not really something Tesseract is designed for. You might want to look at OpenCV http://docs.opencv.org/3.0-beta/modules/text/doc/text.html or Google "text detection in natural images" for other papers and software.

Tom
Reply all
Reply to author
Forward
0 new messages