Hey all
Needed some advice about a problem we are trying to solve.
The problem involves finding the significant words in a para, finding the relevant words out of them, and finding the related terms including concepts that are one level higher than those words (e.g. from "ferro magnetic" to "magnetism")
We are aware about zemanta API and alchemy API, and although short term they do seem like a good idea, I'm afraid that in the long term we would lose out on the flexibility of having our own algorithms and dictionaries.
Q: would it be a better idea to build a layer on top of 3rd party API? Or to build it from scratch? (we are aware of some libraries like conceptnet, etc but haven't used them personally)
I guess the question could be: How difficult is it to reach alchemy API or zemanta API levels of confidence, especially if you are defining the field of tags narrowly? How much approximately will it take to build that tagger?
Thanks for your help!
Samudra
--
_________________________________________________
Mumbai Python Users Group - http://www.mumpy.org/
Mailing Group - http://groups.google.com/group/mumpy/
Membership Management - http://groups.google.com/group/mumpy/subscribe/