Hi,
I'm trying to wrap my head around how missing data is accounted for in blavaan when using it with Stan. Merkle et al. (2021) (
https://www.jstatsoft.org/article/view/v100i06) state blavaan is doing something similar to FIML with lavaan when handling missing values, but I just want to clarify a couple things for myself since I don't think I totally understand how this works.
- Given what Merkle et al. discuss, the posterior estimates for parameters in a blavaan model with missing data (on endogenous variables) are "accounting" for the missingness in a somewhat similar way to what lavaan does with FIML (rather than listwise deleting the data). So, for example, in a blavaan growth model, as long as a person has at least one data point, they are included in the model estimation?
- When I use blavPredict() to predict the missing values, the estimates I obtain for that reflect the posterior distribution of values for only the cells in the data frame with a value of "NA." Is that correct?
- I use brms quite a bit as well, and I'm just curious if anyone could elaborate on how the mi() function works compared to what blavaan is doing with its "full information" approach. Are these the same thing on Stan's backend, since the blavPredict function essentially produces imputations of missing values?
Thanks so much for the input and for all your work on this package!
Garret