MGCFA and parameterization

jakob h

unread,

Apr 28, 2023, 6:21:38 AM4/28/23

to lavaan

Dear all,

First of all, I'd like to thank Yves Rosseel and Terrence Jorgensen for providing the lavaan package and for maintaining this community here. I have been reading a lot of the postings in this google group and they have been tremendously helpful.

I know there have been a ton of postings about measurement invariance with categorical variables on here. Unfortunately, being rather a novice at CFA, I am struggling to make sure that I understand all the recommendations correctly. I would immensely appreciate if someone could help me out and confirm that my model specifications make sense.

So, I am using a dataset with several dozen countries and several tens of thousands of observations. I had the following simple model (Y1-4 are categorical variables with three categories):

model <- 'F1 =~ Y1 + Y2 + Y3 + Y4'
fit<- cfa(model, data=data, ordered = c("Y1", "Y2", "Y3", "Y4"), std.lv = T, parameterization = "delta", estimator = "WLSMV")

The overall fit of this model was not very good, especially the robust RMSEA was 0.160 (I understand from this conversation - https://groups.google.com/g/lavaan/c/pfVem3X_N9A/m/INnOcZWSAgAJ - that the robust RMSEA is the only relevant one for WLSMV). When I tried to test for (configural) measurement invariance, I ran this model in each country separately and found the fit measures to be even worse in quite a lot of countries.

Therefore, I looked at the modification indices and found that several were quite high. Y3 ~~ Y4 was the highest (very large actually, but I assumed this is not worrisome because my n is very large). I know there is some discussion on whether it is okay to respecify the model just on the basic of the modification indices - but since the two variables are measured on the same scale (they are both "agree"/"hard to say"/"disagree"), I thought I could free this parameter and argue that the two variables have a similar type of measurement error. Here are my two first questions:

1. Is this a sound way of proceeding? Especially since
2. Y1 and Y2 also have the same scale as Y3 and Y4? Should I therefore also free Y1 ~~ Y2?

Sticking with Y3 ~~ Y4 for the moment, my next question is:

3. Do I understand correctly that I cannot include this while using delta parameterization and that I need to switch to theta parameterization? Hence my code would be:
model <- 'F1 =~ Y1 + Y2 + Y3 + Y4
Y3 ~~ Y4'
fit<- cfa(model, data=data, ordered = c("Y1", "Y2", "Y3", "Y4"), std.lv = T, parameterization = "theta", estimator = "WLSMV")

Is this correct?

As I move on to test measurement invariance, I had wanted to use the recommendations given by Wu & Estabrook (2016) using the paper by Svetina et al. (Svetina, D., Rutkowski, L., & Rutkowski, D. (2020). Multiple-Group Invariance with Categorical Outcomes Using Updated Guidelines: An Illustration Using M plus and the lavaan/semTools Packages. Structural Equation Modeling: A Multidisciplinary Journal, 27(1), 111–130. https://doi.org/10.1080/10705511.2019.1602776). However,

4. Am I correct in assuming that this is no longer possible given that I use theta parameterization? Do I need to I go with the Millsap & Tein (2004) procedure?

Any advice would be greatly appreciated! Thank you so much for your time.

Terrence Jorgensen

unread,

May 12, 2023, 6:03:38 AM5/12/23

to lavaan

since the two variables are measured on the same scale (they are both "agree"/"hard to say"/"disagree"), I thought I could free this parameter and argue that the two variables have a similar type of measurement error. Here are my two first questions:

1. Is this a sound way of proceeding?

The problem with using modification indices to make your decision is that your decision is driven by your sample data rather than theory. A model represents your theory, so modifications should make sense. Modification indices can be useful to draw your attention, but they can also be ridiculous, so consider your item content.

2. Y1 and Y2 also have the same scale as Y3 and Y4? Should I therefore also free Y1 ~~ Y2?

By "scale", do you mean the same response options? There could indeed be method effects, but your configural model will be saturated if each pair of indicators has a residual correlation. That could very well be fine. It would be analogous to having a 3-indicator factor, which also has a saturated configural model. The metric invariance model's test statistic would then be the same as comparing it to the configural model, because the configural model's chi-squared and df = 0.

Sticking with Y3 ~~ Y4 for the moment, my next question is:

3. Do I understand correctly that I cannot include this while using delta parameterization and that I need to switch to theta parameterization?

Residual covariances are estimable in either parameterization. The reason to use theta parameterization is if you want to test strict invariance (equality of residual variances).

4. Am I correct in assuming that this is no longer possible given that I use theta parameterization?

I believe that is the parameterization recommended.

Do I need to I go with the Millsap & Tein (2004) procedure?

No, never.

Terrence D. Jorgensen (he, him, his)
Assistant Professor, Methods and Statistics
Research Institute for Child Development and Education, the University of Amsterdam
http://www.uva.nl/profile/t.d.jorgensen

jakob h

unread,

Jun 20, 2023, 3:31:22 PM6/20/23

to lavaan

Dear Terrence,

My sincere apologies for my late response and thank you for taking the time to answer my questions, this is extremely helpful! I really appreciate your help!

Reply all

Reply to author

Forward