> On Jun 18, 2016, at 9:54 AM, Charles Driver <
cdriv...@gmail.com> wrote:
>
>
> If everything's working, the adaptation in all chains should
> hit roughly the same step size and mass matrix. Getting to the
> right answer faster is better.
>
>
> Right yes. I guess this is a good indicator that even though the chains looked ok to my eye, the adaptation parameters on the slow version were not - there were quite some differences in step size.
Were you getting divergences? That can indicate that
the step size is too large. Otherwise, it should be OK.
>
> It is possible to adapt too quickly and get stuck outside
> of the typical set you want to visit for sampling.
>
> But do such cases generate otherwise healthy looking output? I would find that surprising and a little worrying!
They could, but typically won't for HMC. What will happen
is that there will be divergences or bad mixing when
starting from diffuse starting points.
> So we tend to
> be conservative in our approach in the hopes that more models adapt
> properly.
>
> I agree that increasing robustness at possible cost of speed is a reasonable approach for defaults, within limits of course :) Though given the results here, I do wonder how many other 'slow hierarchical model' questions are in part a result of this. I was dropping parameters from the model to deal with it before stumbling on this issue...
Certainly something we could look into. As Krzysztof said,
it'd help if you can share the example.
> > The manual speaks of 'expanding windows' during slow adaptation - does this mean that if I specify a window of 5, that this is the first window and later windows are larger? how much larger?
>
> Yes. They double each time up until the final window.
>
> Great, then it seems so long as sampling reaches the typical set, there should be no concerns about having started with a low value...
No, it'll wind up with a winow of roughly half the warmup iterations.
The main problem that arises is if one chain doesn't hit the proper
mean and variance, and R-hat will usually pick that up if you have diffuse
inits.
- Bob