Extracting text from animated GIFs

226 views

Skip to first unread message

Daniel Bishop

unread,

Feb 19, 2016, 4:33:50 AM2/19/16

to tesseract-ocr

Hello everyone!

I'm just getting started with Tesseract and am wowed at how well it does on tasks like scanned black and white text! I'm... less than thrilled at how it does at my current endeavor, which is to extract the text from animated GIFs, such as from reaction GIFs and memes and so on.

After reading the FAQ and the ImproveQuality articles as well as some further prodding around, it seems the DPI of the images isn't too small, but rather that most of the issue comes from the variety of background colors around the text and/or the font(s) commonly used for memes.

Does anyone have any experience with this, or have any helpful advice for this specific task? Attached is a sample of the kind of thing I want to process.

Thank you for your time.

(Incidentally, even though the new version of leptonica and tesseract both say they support gifs, I get the following error when I send a gif in:

Tesseract Open Source OCR Engine v3.04.00 with Leptonica

Error in pixReadMemGif: function not present

Error in pixReadMem: gif: no pix returned

Error during processing.

)

Reply all

Reply to author

Forward

0 new messages