Soft Cosine Similarity features

54 views
Skip to first unread message

Tedo Vrbanec

unread,
May 17, 2021, 9:59:30 AM5/17/21
to Gensim
What are the features in SCS?
I can see that similarity index is brought by
termsim_index = WordEmbeddingSimilarityIndex(model.wv),
but what is WordEmbeddingSimilarityIndex doing? :)
Thanks...

Radim Řehůřek

unread,
May 18, 2021, 2:45:07 AM5/18/21
to Gensim
Hi Tedo,

we have a tutorial for SCM:

HTH,
Radim

Tedo Vrbanec

unread,
May 18, 2021, 2:40:25 PM5/18/21
to Gensim
Thanks, Radim, but the explanation is sparce. ;)

Now, as mentioned earlier, we will be using some downloaded pre-trained embeddings. We load these into a Gensim Word2Vec model class and we build a term similarity mextrix using the embeddings.

import gensim.downloader as api model = api.load('word2vec-google-news-300') 

from gensim.similarities import SparseTermSimilarityMatrix, WordEmbeddingSimilarityIndex

termsim_index = WordEmbeddingSimilarityIndex(model) termsim_matrix = SparseTermSimilarityMatrix(termsim_index, dictionary, tfidf)


Reply all
Reply to author
Forward
0 new messages