I think we could keep MetaMeta API as a spec, with implementations in what languages are available. Think of the MetaMeta API as the front controller that dispatches requests to whatever can do the job:
- proxy (+/- some processing) to external service
I used sphinx for some experiments in 2009, it was running as a component in RePlay [1], triggered as command from an Ant script; and RePlay was validating each "word" against wikipedia to be sure that it is a real word; the results were quite funny, see
But those words were never displayed, they were used for search, a metaphone or soundex search…
Now, if you have some context for a video, some related articles (from the links around it, etc.) you can easily align concepts from the related articles to the sound-like equivalent from sphinx.
BTW, RePlay was also dumping each keyframe as image and was doing OCR to scoop for words to be indexed.
Laurian