I'm working to classify utterances as having either a positive or
negative sentiment. One of the techniques I'm exploring is using the
SentiWordNet database to assign sentiment scores to individual words.
The database contains two scores, positive and negative, for each
synset in WordNet. But, I only have the raw text and POS of the
utterance and need to determine the appropriate synset(s). Is there a
way to get frequencies for each possible synset given a lemma word, so
I can weight the scores appropriately? If not, does anyone know of a
data source that provides these frequencies?
Thanks,
Ben
Thanks for the information. I'll try using just the first one, or
maybe a weighted average of the first three or something. I suppose
this popularity data must exist somewhere. I'll try to search for it.
Ben