Do we use the same covariance matrix SIGMA (for the distribution over
y) for each state?
Or do we use a different covariance matrix (say SIGMA_i) for each
state. E.g. with SIGMA_1 = sigma_1^2 * I, SIGMA_2 = sigma_2^2 * I,.....
-g
For part b, should I calculate the expected log-likelihood or just the
log-likelihood?
On Nov 5, 3:19 am, "gregory.vali...@gmail.com"
> > state. E.g. with SIGMA_1 = sigma_1^2 * I, SIGMA_2 = sigma_2^2 * I,.....- Hide quoted text -
>
> - Show quoted text -
A component density corresponds to the distribution p(data point |
latent state)
> For part b, should I calculate the expected log-likelihood or just the
> log-likelihood?
You don't need to compute it - just optimize it, which is gotten by
moment matching.
-Percy
Do you mean you want the optimized hidden variables or are the optimal
model parameters found through EM enough?