Hi all,
I have a question concerning prediction in lavaan. We have a CFA model with (for now) 6 latent factors and 13 indicators. We have fitted this model on a cognitively healthy sample, and then applied it to a patient sample using the predict function.
What we wanted to do next is investigate to what extent the (healthy) predictions of the latent factor scores are influenced when data for one or more of the indicators is missing. In the future, the model will be used to give an estimate of cognitive functioning on different domains, and it is very likely that not every person will have completed the full test battery. Thus, it is essentially a test of robustness of the model (e.g., which tests are essential for reliable latent factor scores?).
In a first exploratory and rather crude attempt, we set all the values of one indicator to zero and fed this dataframe to lavPredict. In this case, however, lavaan gives an error because of the lack of variance. Our conception of what lavPredict does is that it estimates the factor scores on a subject-by-subject basis, which we thought should be possible in principle when all values of one or more indicators are zero (analogous to a simple regression equation, where a predictor would simply be cancelled out by multiplying its weight with zero).
My question is therefore both a technical and practical one: does it make theoretical sense to carry out such an analysis, and if so, how would one go about doing this using predict or lavPredict?
Thanks in advance!