I've used tesseract to OCR frames from 640x480 screencast videos,
generally it worked fine:
http://ianozsvald.com/2010/05/17/extracting-keyword-text-from-screencasts-with-ocr/
What problems are you seeing when you try tesseract?
Ian.
> --
> You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
> To post to this group, send email to tesser...@googlegroups.com.
> To unsubscribe from this group, send email to tesseract-oc...@googlegroups.com.
> For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.
>
>
--
Ian Ozsvald (A.I. researcher, screencaster)
i...@IanOzsvald.com
http://IanOzsvald.com
http://MorConsulting.com/
http://blog.AICookbook.com/
http://TheScreencastingHandbook.com
http://FivePoundApp.com/
http://twitter.com/IanOzsvald
For my videos I took 640x480 FLV screencasts (from ShowMeDo.com -
pretty high quality videos with hardly any artefacts) and I ran
tesseract 2 on the colour screengrabs without rescaling.
What resolution are you capturing at?
If the fonts are small you might want to manually try to sharpen the
image, in case anti-aliasing/smoothing is blending adjacent characters
into one another? You could visually confirm if this looks to be the
case.
Maybe you could upload a sample screengrab and explain what it gets
right and which errors it gets (maybe by drawing on the image)?
i.
> For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.
>
>
--
Ian Ozsvald (A.I. researcher, screencaster)
i...@IanOzsvald.com
> For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.
>
>
--
Ian Ozsvald (A.I. researcher, screencaster)
i...@IanOzsvald.com
I have a couple of questions:
1. How can I calculate the ideal image size (300dpi?) to feed to
tesseract? I mean, how do I identify how much scaling the image needs,
before the OCR procedure.
2. I'm currently using ImageMagick's convert program for scaling and
converting to grayscale. Would it make a difference if I used
leptonica instead?
3. Do the bits of color matter? Is there an optimal color depth?
4. Does the OCR work best when ClearType is enabled or disabled?