Comparing topic models with different preprocessing, what metrics do people use?

58 views

Skip to first unread message

Kyle Jensen

unread,

May 5, 2012, 7:28:38 AM5/5/12

to gen...@googlegroups.com

Hi -

I'm building a number of models that differ in the way the corpus is preprocessed (stemming, lemmatization, etc.). However, I do not know how best to compare the "quality" of the resulting models. I was thinking of using held-out document perplexity.

What do others use? A rough metric is acceptable to me.

Thanks!

Kyle

Radim Řehůřek

unread,

May 7, 2012, 5:48:09 PM5/7/12

to gensim

Hi Kyle,

it's always best use evaluate the quality directly on your end task,
the one you're trying to solve with these unsupervised models.

Substitute generic metrics like perplexity are also an option, but may
not necessarily correlate well with what you're really trying to
achieve (i.e., a model with lower perplexity could nevertheless
perform worse in practice).

Best,
Radim

Reply all

Reply to author

Forward

0 new messages