Comparing topic models with different preprocessing, what metrics do people use?

58 views
Skip to first unread message

Kyle Jensen

unread,
May 5, 2012, 7:28:38 AM5/5/12
to gen...@googlegroups.com
Hi - 

I'm building a number of models that differ in the way the corpus is preprocessed (stemming, lemmatization, etc.).  However, I do not know how best to compare the "quality" of the resulting models.  I was thinking of using held-out document perplexity.

What do others use?  A rough metric is acceptable to me.

Thanks!
Kyle

Radim Řehůřek

unread,
May 7, 2012, 5:48:09 PM5/7/12
to gensim
Hi Kyle,

it's always best use evaluate the quality directly on your end task,
the one you're trying to solve with these unsupervised models.

Substitute generic metrics like perplexity are also an option, but may
not necessarily correlate well with what you're really trying to
achieve (i.e., a model with lower perplexity could nevertheless
perform worse in practice).

Best,
Radim
Reply all
Reply to author
Forward
0 new messages