Evidence Lower Bound as model selection criterion

532 views
Skip to first unread message

Marcel Falkiewicz

unread,
Aug 5, 2015, 3:45:24 AM8/5/15
to Stan users mailing list
Hello, 
 in the field of neuroscience there is a 'school', which claims that evidence lower bound can be considered as the ultimate criterion for model comparison and selection. For example, see http://www.sciencedirect.com/science/article/pii/S1053811911008160# . However, I've read many papers from different fields and haven't encountered such large enthusiasm towards it. Since my statistical knowledge is pretty basic, I would like to ask your opinion - can ELBO be considered as a model selection criterion? If so, what are the neccessary conditions?

Regards,
 Marcel

Michael Betancourt

unread,
Aug 5, 2015, 10:20:46 AM8/5/15
to stan-...@googlegroups.com
The ELBO is an approximation to the evidence (sometimes called
the marginal likelihood).  It tends to be a better approximation then
BIC, but even if it is very accurate you have to deal with the subtleties
of the evidence itself.  For that see the numerous relevant posts on
this list as well as the discussion in Bayesian Data Analysis.

--
You received this message because you are subscribed to the Google Groups "Stan users mailing list" group.
To unsubscribe from this group and stop receiving emails from it, send an email to stan-users+...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Dustin Tran

unread,
Aug 5, 2015, 10:45:47 AM8/5/15
to stan-...@googlegroups.com
Aside from subtleties with using the marginal likelihood itself, the ELBO can indeed be used. The bigger issue to keep in mind though is the sensitivity of the variational approximation to the posterior. That is, using the ELBO as a replacement for the marginal likelihood makes the assumption that the KL divergence between the posterior and the variational distribution are the same for every model. This is rarely true, and it is easy to construct scenarios where the marginal likelihood is actually lower for model 1 but the ELBO is higher for model 2, because the variational family cannot fit model 1 very well. This is not nearly as bad as AIC or BIC in "most cases”, however, so I’m not surprised by that paper’s stance.

Dustin

Dustin Tran

unread,
Aug 5, 2015, 10:53:09 AM8/5/15
to stan-...@googlegroups.com
> it is easy to construct scenarios where the marginal likelihood is actually _higher_ for model 1 but the ELBO is higher for model 2,
fixed typo

Dustin

Marcel Falkiewicz

unread,
Aug 5, 2015, 4:02:47 PM8/5/15
to stan-...@googlegroups.com
Thank you guys for clarification, despite the sad conclusion:) Anyway,
it's good to know that ELBO could be useful in model selection,
considering all the limitations. Maybe this should be mentioned in
documentation?

Andrew Gelman

unread,
Aug 5, 2015, 6:10:04 PM8/5/15
to stan-...@googlegroups.com
Yes, to start with I think the term “evidence” is horrible; second, the whole “lower bound” thing is pretty meaningless in that if you are comparing 2 models, you’ll use the difference so a “lower bound” is no better than an “estimate”; third, it typically depends on aspects of the prior that are assigned arbitrarily, for example if you change your prior from normal(0,10) to normal(0,100), your marginal likelihood changes by a factor of 10.  So, yeah, basically don’t do it.  If you want to do model comparison I recommend predictive criteria such as disucssed here:
A


On Aug 5, 2015, at 10:20 AM, Michael Betancourt <betan...@gmail.com> wrote:

Andrew Gelman

unread,
Aug 5, 2015, 6:21:15 PM8/5/15
to stan-...@googlegroups.com
Ummm, no!
Reply all
Reply to author
Forward
0 new messages