Suppose one is using an indicator variable approach to judge the relative strength of several covariates on some part of a hierarchical Bayesian model (such as the process model component of an occupancy model). Then one can calculate the proportion of times each variable was included in an MCMC iteration and use this as a judgement of variable importance. My question is how to get estimates for the effect parameters being selected? The first thing that come to mind is a choice between 1) using the full posterior for parameter estimate, or 2) selecting only the iterations that included that variable to compute the posterior estimates. The problem with the first is if the parameter has low frequency of selection, then the posterior is heavily influenced by the prior. For the second, this seems like it runs the risk of inflating the strength of the effect (Lukacs, P., K. Burnham, and D. Anderson. 2010. Model selection bias and Freedman’s paradox. Annals of the Institute of Statistical Mathematics 62:117-125). I am considering using a different approach. What if I use a slab and spike prior for the effects with the spike being an improper prior with infinite density centered at zero. The slab part is the normal uninformative prior for the parameter. I think this will accomplish something akin to what is achieved in model averaging using an information criterion. This is different than what is usually done with slab and spike priors where the spike is chosen to match the parameter estimated without variable selection, purely to improve mixing. I would like to know what others think about this idea, and if anyone has a citation that used this method?
Suppose one is using an indicator variable approach to judge the relative strength of several covariates on some part of a hierarchical Bayesian model (such as the process model component of an occupancy model). Then one can calculate the proportion of times each variable was included in an MCMC iteration and use this as a judgement of variable importance. My question is how to get estimates for the effect parameters being selected? The first thing that come to mind is a choice between 1) using the full posterior for parameter estimate, or 2) selecting only the iterations that included that variable to compute the posterior estimates. The problem with the first is if the parameter has low frequency of selection, then the posterior is heavily influenced by the prior. For the second, this seems like it runs the risk of inflating the strength of the effect (Lukacs, P., K. Burnham, and D. Anderson. 2010. Model selection bias and Freedman’s paradox. Annals of the Institute of Statistical Mathematics 62:117-125). I am considering using a different approach. What if I use a slab and spike prior for the effects with the spike being an improper prior with infinite density centered at zero. The slab part is the normal uninformative prior for the parameter. I think this will accomplish something akin to what is achieved in model averaging using an information criterion. This is different than what is usually done with slab and spike priors where the spike is chosen to match the parameter estimated without variable selection, purely to improve mixing. I would like to know what others think about this idea, and if anyone has a citation that used this method?
--
*** Three hierarchical modeling email lists ***
(1) unmarked: for questions specific to the R package unmarked
(2) SCR: for design and Bayesian or non-bayesian analysis of spatial capture-recapture
(3) HMecology (this list): for everything else, especially material covered in the books by Royle & Dorazio (2008), Kéry & Schaub (2012), Kéry & Royle (2016, 2021) and Schaub & Kéry (2022)
---
You received this message because you are subscribed to the Google Groups "hmecology: Hierarchical Modeling in Ecology" group.
To unsubscribe from this group and stop receiving emails from it, send an email to hmecology+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/hmecology/10423e2d-fdf0-49bc-b86e-df22fb487d15n%40googlegroups.com.
I think that restricting the posterior of an effect estimate to just the MCMC iterations when the variable was included perhaps makes sense, because then the effect is estimated over models with and without the other variables weighted by their inclusion probabilities. But I would be interested in hearing other’s thoughts on this.
Eric