If you can find a suitable open source OCR library or command line
tool it shouldn't be too difficult to integrate. There is a function
extract_text() in image_processing.php that is a good place to add
this. It would be a matter of detecting appropriate bitmaps types by
the extension and forwarding it to the OCR command line tool, just
like the other blocks in that function for Word Document (etc.).
OCR projects I would look at are:
http://en.wikipedia.org/wiki/GOCR
http://en.wikipedia.org/wiki/Tesseract_(software)
http://en.wikipedia.org/wiki/Ocrad
I'm not sure about Ocropus... I limited my search to Ubuntu packages.
I hope this helps.
You are welcome to request features but it's very unlikely someone
will come along and develop it for free. But you never know! :)
If you fund development or develop this yourself it would be good to
have it in the base, if you could supply a patch.
Thanks,
Dan