First, I would like to express my thanks to Dr. Chalmers for his help last week.
I'm attempting to analyze dichotomous reading comprehension data (n=10,521) for DIF with 1 reference group and 5 focal groups. I am quite new to MIRT so I apologize in advance for any obvious (or subtle) errors and would appreciate your feedback.
My understanding is that as a very first step I need to determine the dimensionality of the data. To do so I ran this code (not including multipleGroup at this stage):
mod1 <- mirt(read_mirt, 1, verbose=TRUE)
mod2 <- mirt(read_mirt, 2, verbose=TRUE)
mod3 <- mirt(read_mirt, 3, verbose=TRUE)
mirt::M2(mod1)
mirt::M2(mod2)
mirt::M2(mod3)
Here are the model fit statistics produced by M2:
> mirt::M2(mod1)
M2 df p RMSEA RMSEA_5 RMSEA_95 SRMSR TLI CFI
stats 1522.236 405 0 0.01619336 0.01533095 0.01706289 0.01842013 0.9853581 0.9863679
> mirt::M2(mod2)
M2 df p RMSEA RMSEA_5 RMSEA_95 SRMSR TLI CFI
stats 1035.82 376 0 0.01291549 0.01198092 0.01385578 0.01546339 0.9906858 0.9919491
> mirt::M2(mod3)
M2 df p RMSEA RMSEA_5 RMSEA_95 SRMSR TLI CFI
stats 784.1172 348 0 0.01091451 0.009897989 0.011932 0.01353918 0.9933483 0.9946786
I've read that SRMSR less than .05 is acceptable, and that TLI and CFI greater than .95 are acceptable. (I've also heard that RMSEA itself is not ideal for model selection.) However, I'm unsure if these are cut-offs, or if relatively smaller SRMSR and greater TLI/CFI are considered better fits (along the lines of BIC/AIC interpretations). Therefore, I am not sure which of these models is a better fit.
Although I am not entirely sure this is how I should proceed, I calculated AIC and BIC using the anova method. This appears to show that 2 or 3 dimensions are preferable to 1. Comparing 2 to 3 dimensions, 3 dimensions appears preferable for all indices except non-sample size adjusted BIC.
> anova (mod1, mod2)
Model 1: mirt(data = read_mirt, model = 1, verbose = TRUE)
Model 2: mirt(data = read_mirt, model = 2, verbose = TRUE)
AIC AICc SABIC BIC logLik X2 df p
1 268046.1 268046.8 268291.1 268481.8 -133963.1 NaN NaN NaN
2 267718.3 267719.9 268081.7 268364.6 -133770.2 385.808 29 0
> anova (mod1, mod3)
Model 1: mirt(data = read_mirt, model = 1, verbose = TRUE)
Model 2: mirt(data = read_mirt, model = 3, verbose = TRUE)
AIC AICc SABIC BIC logLik X2 df p
1 268046.1 268046.8 268291.1 268481.8 -133963.1 NaN NaN NaN
2 267554.0 267556.6 268031.7 268403.5 -133660.0 606.143 57 0
> anova (mod2, mod3)
Model 1: mirt(data = read_mirt, model = 2, verbose = TRUE)
Model 2: mirt(data = read_mirt, model = 3, verbose = TRUE)
AIC AICc SABIC BIC logLik X2 df p
1 267718.3 267719.9 268081.7 268364.6 -133770.2 NaN NaN NaN
2 267554.0 267556.6 268031.7 268403.5 -133660.0 220.335 28 0
>
It appears the best-fitting IRT model has three dimensions. If the forum could provide advice about model fit, I would appreciate it very much.
(Note: is it necessary to run 4-dimensional model for comparison?)
Thank you!
Jeanne