You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Gensim
I am trying to figure out how to use TF-IDF to compare multi-word vectors. I just ran the Gensim TF-IDF on a Wikipedia corpus. After, I noticed when common words like 'am', 'like' and 'good' indexed for in the model they did not return a tf-idf vector with a low score but rather returned an empty vector which throws off my algorithm. Is there any way parameter to ensure these words have a smaller score?
Cheers,
Sam
Radim Řehůřek
unread,
Jun 15, 2021, 8:33:47 AM6/15/21
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Gensim
Hi Sam,
in TFIDF, words don't really have vectors – the score for a specific term depends on a specific document.
So not sure what you mean. What gives you an "empty vector", exactly?
Best,
Radim
Masto Music
unread,
Jun 17, 2021, 5:55:36 AM6/17/21
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Gensim
I understand in TFIDF words don't really have vectors, but I am trying to use TF-IDF scores to multiply by fasttext word vectors to compute a meaning for an overall phrase.
So the fasttext word vectors for each word are scaled by the importance of the word (which comes from TFIDF score).
Masto Music
unread,
Jun 17, 2021, 5:56:41 AM6/17/21
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message