Posterior Statistic in Tracer vs Path and Stepping Stone Sampling Statistic

158 views
Skip to first unread message

dpe...@asu.edu

unread,
Feb 21, 2018, 9:29:27 PM2/21/18
to beast-users
Hi all, 

In tracer the statistic at the top is labeled "posterior".  What does this represent?  I thought it was the log likelihood of the model but it seems like that is determined using path and stepping stone sampling.  I'm pretty sure I'm missing something obvious since likelihood + prior = posterior but I'm new to Bayesian analysis.  An answer that contrasts it with ps/ss would be awesome.

Thanks much, Dan

Alexei Drummond

unread,
Feb 21, 2018, 9:48:13 PM2/21/18
to beast...@googlegroups.com
Hi Dan,

The “posterior” statistic it is the log of the posterior probability for each state in the chain, P(X_i | D), for parameters X in step i given data D.

The “likelihood” statistic is the likelihood of the data for each state in the chain, P(D | X_i). So if you average this statistic over the MCMC chain you get the average probability of the data, over the parameter space *weighted by the posterior* (since the MCMC chain is a random sample from the posterior).

In contrast the marginal likelihood, P(D), is the average probability of the data over the parameter space *weighted by the prior*. So you need to arrive at it using a different technique, like path sampling.

Cheers
Alexei

--
You received this message because you are subscribed to the Google Groups "beast-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to beast-users...@googlegroups.com.
To post to this group, send email to beast...@googlegroups.com.
Visit this group at https://groups.google.com/group/beast-users.
For more options, visit https://groups.google.com/d/optout.

Andrew Rambaut

unread,
Feb 22, 2018, 3:30:46 AM2/22/18
to beast...@googlegroups.com
The value labelled ‘posterior’ in the output of BEAST is the log product of all the prior densities and the data likelihoods (i.e., the sum of the prior and the likelihood columns in the trace file). Technically it is the posterior probability of the model given the data * the probability of the data (i.e., the normalizing constant - the one that is intractable to calculate and is the reason we do MCMC in the first place). It is has been erroneously labelled ‘posterior’ since the beginning of BEAST. Basically it is logged in a normal MCMC run simply to provide diagnostic information about convergence and mixing.  

Andrew

dpe...@asu.edu

unread,
Feb 22, 2018, 5:27:57 PM2/22/18
to beast-users
Thanks very much.  It makes sense that the P(data) must be embedded in one of the 3 tracer terms.

So posterior (via tracer) = P(model|data) * P(data)

Could I then separate them using the result of the path sampling?

That posterior (via tracer) / path_sampling_log_marginal_likelihood = P(data)  ?

Thanks again.
Reply all
Reply to author
Forward
0 new messages