Difference between log entropy model and tfidf

905 views
Skip to first unread message

nilz

unread,
Aug 17, 2011, 7:05:45 PM8/17/11
to gensim
Hello gensim users,

I was wondering if someone could point me at some literature in IR
research that discusses the difference between log entropy and tfidf?
Some works I have seen on an initial search use the two synonymously
while there are others that tend to make some distinction.

For my data set, I get much better results with the log entropy model
as opposed to with tdidf and I want to understand and document the
differences between the two.

Thanks
Nilz

Stephan Gabler

unread,
Aug 18, 2011, 3:03:51 AM8/18/11
to gen...@googlegroups.com

Hey Nilz,

I implemented the log entropy model after reading this article:

An empirical evaluation of models of text document similarity
MD Lee, B Pincombe… - … of the 27th Annual Conference of the …, 2005

http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.111.7144&rep=rep1&type=pdf


hope this helps,

stephan

Dan Howarth

unread,
Sep 20, 2015, 5:54:12 PM9/20/15
to gensim, stephan...@googlemail.com
Could you explain how this can be used to create LSA model with the gensim library?
Reply all
Reply to author
Forward
0 new messages