There are different ways to overcome this issue with antonyms.
See, for example,
http://aclanthology.info/papers/integrating-distributional-lexical-contrast-into-word-embeddings-for-antonym-synonym-distinction
10.08.2016 17:29, jayant jain wrote:
> Another reason I'm unsure about whether word2vec is the ideal approach
> for this is that word2vec distance is not always the same as semantic
> distance. A brief explanation for this would be - word2vec trains word
> representations on the basis of context words, and often both synonyms
> and antonyms appear in similar contexts.
>
> An example from restaurant reviews would be - The food was __. Both
> negative and positive words would fit well there, and so often antonyms
> end up having similar representations (from personal experience,
> `model.most_similar('good')` almost always seems to have `bad` in the
> top 5 results.
>
> A few links which I found quite insightful -
> 1. Messing around with word2vec
> <
https://quomodocumque.wordpress.com/2016/01/15/messing-around-with-word2vec/>
> 2. Exploring antonyms with word2vec
> <
https://gist.github.com/kylemcdonald/9bedafead69145875b8c>
>
> On Wednesday, 10 August 2016 12:33:48 UTC+5:30, Lior Magen wrote:
>
> I'm trying to get a list of positive words and a list of negative
> words using Word2Vec.
>
> Iv'e create a small set of generic negative/positive words and using
> them I'm trying to do this.
>
> words_list = dict(Positive=['good', 'great', 'best'],
> Negative=['bad', 'awful', 'terribl', 'disappoint'])
>
> sentiments_dict['Positive'] = [x[0] for x in
> self.model.most_similar(positive=words_list['Positive'],
> negative=words_list['Negative'],
> topn=most_common, restrict_vocab=1000)]
>
> sentiments_dict['Positive'].extend(words_list['Positive'])
>
> sentiments_dict['Negative'] = [x[0] for x in
> self.model.most_similar(positive=words_list['Negative'],
> negative=words_list['Positive'],
> topn=most_common, restrict_vocab=1000)]
>
> sentiments_dict['Negative'].extend(words_list['Negative'])
>
>
>
> The problem is that in some cases the results are fine but in other
> they pretty suck.
>
> In suggestions on how to improve that? I believe that the basic idea
> is right but there's something I miss I guess.
>
> --
> You received this message because you are subscribed to the Google
> Groups "gensim" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to
gensim+un...@googlegroups.com
> <mailto:
gensim+un...@googlegroups.com>.
> For more options, visit
https://groups.google.com/d/optout.
--
Solve et coagula!
Andrey