Il 16/10/2013 00:32, Michael Betancourt ha scritto:
> The narrow spikes are actually good! Hierarchical models tend to have fat tails, and that spiking behavior is exactly
> what you want to see when sampling fat tails efficiently.
Good. But what about the pairs() output?
The off-diagonal plots (file pairs.png attached) look, how could I say,
somewhat strange-shaped to me.
> You can probably stick to a higher delta and leave epsilon_pm at zero.
I've tried. You are right. Of course ;-)
However, if I set delta = 0.99 and leave epsilon_pm at zero (and probs =
c(0.025, 0.5, 0.975)) I get:
mu_a 1.14 0.02 1.05 -0.81 1.11 3.28 4062 1
sigma_a 1.98 0.03 1.37 0.80 1.62 5.48 2517 1
i.e. mu_a in (-0.81, 3.28), sigma_a in (0.80, 5.48).
Setting epsilon_pm = 1 and delta = 0.9:
mu_a 1.12 0.02 0.94 -0.81 1.12 3.09 2597 1
sigma_a 1.91 0.03 1.13 0.79 1.60 4.95 1752 1
that looks a bit better. Just by chance?
> In fact, if you really want to find the optimal
> performance vary delta between 0.6 and 0.99 and record the inferences (mean+/- std-dev or, even better, the percentiles)
> of sigma. You should see the inferences stabilize at some value of delta (i.e. they're constant for all greater values of delta) --
> that will be the optimal value to use.
I'm playing with a lot of toy models because I've to work on a tough
real model: a multilevel logistic non-nested model, two 10x10 covariance
matrices, just 25000 observations. A nightmare.
Looking for the optimal value of delta could take a long time...
What if I just try 0.99? Any problems?
> Also keep an eye out on the tree_depth. If it starts to push up against the default max you'll have to increase the max_depth
> yourself to maintain efficient sampling.
May I say that get_sampler_params() isn't that convenient? ;-)
I've written a plot_sampler_params() function (code and plots attached).
Yes, looking at the plots it looks as if I really should increase
max_treedepth.
I'll try.
Thanks again!
Sergio