Andrew Gelman
unread,Jun 9, 2016, 12:49:03 PM6/9/16Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Sign in to report message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to stan development mailing list, Rayleigh Lei, Aki Vehtari, Dustin Tran, Alp Kucukelbir, Daniel Lee, Bob Carpenter, Ben Goodrich
Hi--as many of you know, Rayleigh wrote a script based on some code of Ben to run NUTS and then mean-field ADVI on a library of over 200 Stan programs (the Bugs examples, the Arm examples, and a bunch more:
https://github.com/stan-dev/example-models/wiki). The idea, for each model, was to first run NUTS to convergence, and then run ADVI to check that the estimates from ADVI were close. To give ADVI the best chance of succeeding, we used the posterior mean estimates from NUTS as our starting points for ADVI.
Next it's time to summarize the evaluations. For each parameter in a model, we define "close" in terms of the z-score: ((ADVI estimate) - (posterior mean))/(posterior sd), where the posterior mean and sd are from NUTS. The idea is that this z-score should be close to zero. For each model, we computed the maximum of the z-scores for all the parameters in the model (which includes transformed parameters and generated quantities). We then had 200-plus max z-scores, which we plotted; see attached.