Values of two different approaches using the same model (Word2Vec from Gensim) give different values for cosine similarity.
Eg.
$ head -c 50 Gensim_CS.csv
1.0 0.8943 0.16969 0.15607 0.38753 0.46953 0.32108
$ head -c 51 Sklearn_CS.csv
1.0 0.95788 0.64737 0.63894 0.73894 0.77508 0.71154
I must explain how I convert distances given from sklearn into similarities:
1. Results has been divided by maximum of absolute values.
2. Similarity = 1/(1+distance)
I even tried my favorite (angular) transformation:
similarity = 2*arccos(distance)/Pi
That didn't work either.
Now, is it model.wv.n_similarity cosine similarity after all, or not?