Should we create a separate repo for problematic model/data + sampling
configs?
However you'd like to structure it is OK with me.
Do you want to set it up? I can set it up.
Aki
I don't know if we need a separate repo, but there is no great shortage of flaky BUGS examples. All of these were for 21 chains with a base seed of 12345 that is advanced by the chain_id but otherwise kept the default settings
Stan doesn't parse: fire
Stan blows up: salm2, salm, orange
Stan takes way too long: schools (not eight_schools)
Stan doesn't converge: hepatitisME, hepatitis, air, eyes
Stan has bad mixing at a minimum: blocker, kidney, leukfr, magnesium, seeds, birats, stagnant
In each of these cases (except fire), the model wasn't much better with JAGS, so I think it is more a situation where the models are bad (gamma(0,0) priors, etc.). And it underscores that these BUGS examples are not good for unit-testing, but it would be nice if Stan could do a little better.
library(rstan)
blocker <- stan_demo("blocker", seed = 12345, chains = 21, iter = 10000)
> summary(summary(blocker)$summary[,"Rhat"])
Min. 1st Qu. Median Mean 3rd Qu. Max.
1.000 1.001 1.001 1.002 1.002 1.010
pairs(blocker, pars = c("d", "delta_new", "sigma_delta", "lp__"), chain_id = list(6, c(1:5,7:21)))
> convergence(blocker, thin = 25, R = 199)
[1] "Multivariate loss = 1.004"
disco(x = mat, factors = chain_id, distance = FALSE, R = R)
Distance Components: index 1.00
Source Df Sum Dist Mean Dist F-ratio p-value
factors 41 543.49225 13.25591 1.865 0.005
Within 4158 29555.97387 7.10822
Total 4199 30099.46612