There are no mechanisms for categorical sampling in Stan, but
this may not be an issue for you if the categorical part is just data,
which we can handle.
We also don't implement Dirichlet processes or any other Bayesian
non-parametric method that potentially relies on varying numbers
of parameters (such as Bayesian additive regression trees, aka BART).
Our samplers assume a fixed parameter space and although we'd like
to be able to do more, we don't have any plans to expand this part
of Stan any time soon.
Sometimes you can approximate a Dirichlet process with a simple
Dirichlet of high enough dimension. Don't know if that'll work in
this problem or not.
And any mixture model is going to run into label-switching problems,
which you need to be aware of when looking at things like the
R-hat convergence diagnostics, which no longer work. As far as I
know, it's an open research problem to measure convergence in these
settings.
Another huge problem is that these mixture models can be very multi-modal,
which is also something Stan's not good at sampling through (and no system
is good at sampling through if the combinatorics are bad enough, as they are
in say, latent Dirichlet allocation).
- Bob
> --
> You received this message because you are subscribed to the Google Groups "stan users mailing list" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to
stan-users+...@googlegroups.com.
> For more options, visit
https://groups.google.com/groups/opt_out.