In Stan 2.10 (or maybe it's just rstan 2.10), when you print out the model fit it gives a paragraph about The estimated Bayesian Fraction of Missing Information, along with numbers such as .7 or .9 for each chain.
I have a couple questions here. First, what is this?
Second, is it a good idea for this to be in the default output? This seems to be just one more way to get users upset for no reason. Isn't it enough that we have R-hat?
--
You received this message because you are subscribed to the Google Groups "stan development mailing list" group.
To unsubscribe from this group and stop receiving emails from it, send an email to stan-dev+u...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Hi, it sounds to me that this should be one of the things saved in the Stan object but that it should not be printed by default. I think that printing it by default will have the net result of confusing most users, and perhaps deterring some people from using Stan. (This relates to Bob's point that some users are intimidated by Stan's frequent issuing of warning messages.
I agree about the warnings about divergent transitions and treedepth. If R-hat indicates poor mixing, then it can be useful to hear about divergent transitions and treedepth. If mixing is ok, I don't see the advantage of bothering people about transitions and treedepth.
This new thing I really hate because it always appears, it takes up several lines of output, and it always looks bad. We're just asking users to say, "Hey, what's wrong, how come these numbers aren't 1, like they should be?"I think it's fine for this stuff to be available but I think it's bad news for it to be always output. We're making Stan look less reliable than it is.
On Jul 20, 2016, at 10:02 PM, Ben Goodrich <goodri...@gmail.com> wrote:On Wednesday, July 20, 2016 at 9:32:02 PM UTC-4, Andrew Gelman wrote:I agree about the warnings about divergent transitions and treedepth. If R-hat indicates poor mixing, then it can be useful to hear about divergent transitions and treedepth. If mixing is ok, I don't see the advantage of bothering people about transitions and treedepth.
I think that is not correct. If there are divergent transitions, leapfrogs hitting the maximum treedepth, low BFMI, etc. then the R-hat can be close to 1 without the chain(s) having explored the whole posterior.
I'm not a fan of making users look at (and understand) several different things that collectively speak to the dependence between draws when it is easy to just plot the dependence between draws.
This new thing I really hate because it always appears, it takes up several lines of output, and it always looks bad. We're just asking users to say, "Hey, what's wrong, how come these numbers aren't 1, like they should be?"I think it's fine for this stuff to be available but I think it's bad news for it to be always output. We're making Stan look less reliable than it is.
I don't think we want users to presume that models they write will yield good draws from the posterior. New Stan users usually are writing models that don't yield good draws from the posterior.
Andrew and Michael and Ben:
Can you construct a a few examples that manifest these
problems concretely (things that split R-hat diagnoses
but R-hat doesn't, that Michael's metric diagnoses, but
R-hat doesn't, etc.)? I think that'd be a huge help for
both our users and the field as a whole.
It would be really great to have Stan being able to run in a "quiet" mode which only spits out messages when there is a really bad problem.
Having exposed people in industry to Stan, I can assure you that these Warnings which come out of Stan very often scare people, even though they should just ignore it. I know that education here is the best thing, but not all people who want (or should) use Stan have the time to go into such depth; still MCMC is valuable to their work.
There are just too many false positives right now - a mode which would allow for some false negatives would be beneficial (I am not saying to make this a default!).
Sebastian
To put this into perspective, if we get ANY warning here during our analyses, then this leads to delays as we always have to explain that stuff. This is the reason why SAS by default will never say anything unless it has to STOP for some huge problem (of course, reporting level can be changed).
Sebastian
Just to emphasize - I like the warnings, I just would love to have the option to turn them off or get fewer of them...
To put this into perspective, if we get ANY warning here during our analyses, then this leads to delays as we always have to explain that stuff.
To put this into perspective, if we get ANY warning here during our analyses, then this leads to delays as we always have to explain that stuff.
I think that is the appropriate policy, although I can imagine it is frustrating when the person you have to explain it to does not have much background in the subject.
To put it another way, had we had this conversation a few months earlier, and Bob had said, "we should leave the diagnostics on by default," these diagnostics would not have included "The estimated Bayesian Fraction of Missing Information." I very strongly oppose this ratcheting in which more and more warnings get spewed out, thus giving people the idea that Stan is fragile.
Hi, yes, I agree that I would not stop when R-hat is 1.2. I'm just curious: in this particular example are there some parameters where the inferences are way off?
When I say "inferences are way off," I didn't mean relative to true values, I meant relative to the posterior distribution.
To be clear, outside of R-hat there aren’t really any useful MCMC
diagnostics.