Bias corrected CI for mediation with multiply imputed data (SEM.MI vs. FIML)?

167 views

Skip to first unread message

zrc2...@gmail.com

unread,

Feb 12, 2019, 7:13:58 PM2/12/19

to lavaan

Hello,

I am using the "sem.mi" function to run a series of mediation models for imputed data. Importantly, I used mice to impute the entire dataset first and ran models based on the imputed data. I hope to get the bias corrected ci's and was not sure how to get the complete set of results you'd get from "parameterEstimates". Thank you in advance for any help/insight you can offer.

(1) MI dataset

HSMiss <- HolzingerSwineford1939[ , c(paste("x", 1:9, sep = ""),"ageyr","agemo","school")]

impHS=mice(HolzingerSwineford1939, m=20, seed=123, print=TRUE)

imps=impHS$imputations

View(HolzingerSwineford1939

+ )

HS.model <- ' # direct effect

+ x9 ~ c*x3

+ # mediator

+ x6 ~ a*x3

+ x9 ~ b*x6

+ # indirect effect (a*b)

+ ab := a*b

+ # total effect

+ total := c + (a*b)

+ '

out2=sem.mi(HS.model, data=impHS)

summary(out2)

THIS IS ALL THAT WAS PRODUCED FOR MI DATASET

parameterEstimates(out2, boot.ci.type = "bca.simple", standardized = TRUE)

lhs op rhs label std.lv std.all std.nox

1 x9 ~ x3 c 0.094 0.106 0.094

2 x6 ~ x3 a 0.060 0.062 0.055

3 x9 ~ x6 b 0.033 0.036 0.036

4 x9 ~~ x9 0.472 0.465 0.465

5 x6 ~~ x6 0.606 0.506 0.506

6 x3 ~~ x3 0.669 0.524 0.669

7 ab := a*b ab 0.002 0.002 0.002

8 total := c+(a*b) total 0.096 0.108 0.096

(2) FIML

targetFit=sem(HS.model, HSMiss, meanstructure=TRUE, std.lv=TRUE, missing="fiml",se="bootstrap", bootstrap=5000)

summary(targetFit, rsquare=TRUE, standardized=TRUE)

HERE, I GET CI's and other output along with it when I use the same command

> parameterEstimates(targetFit, boot.ci.type = "bca.simple", standardized = TRUE)

lhs op rhs label est se z pvalue ci.lower ci.upper std.lv std.all std.nox

1 x9 ~ x3 c 0.266 0.047 5.651 0.000 0.176 0.361 0.266 0.298 0.264

2 x6 ~ x3 a 0.191 0.060 3.201 0.001 0.077 0.313 0.191 0.198 0.175

3 x9 ~ x6 b 0.143 0.055 2.603 0.009 0.037 0.253 0.143 0.155 0.155

4 x9 ~~ x9 0.882 0.071 12.499 0.000 0.762 1.047 0.882 0.869 0.869

5 x6 ~~ x6 1.150 0.105 10.904 0.000 0.962 1.378 1.150 0.961 0.961

6 x3 ~~ x3 1.275 0.000 NA NA 1.275 1.275 1.275 1.000 1.275

7 x9 ~1 4.463 0.161 27.672 0.000 4.142 4.771 4.463 4.430 4.430

8 x6 ~1 1.755 0.128 13.686 0.000 1.496 2.004 1.755 1.604 1.604

9 x3 ~1 2.250 0.000 NA NA 2.250 2.250 2.250 1.993 2.250

10 ab := a*b ab 0.027 0.015 1.812 0.070 0.006 0.068 0.027 0.031 0.027

11 total := c+(a*b) total 0.293 0.049 6.043 0.000 0.201 0.391 0.293 0.329 0.291

Mauricio Garnier-Villarreal

unread,

Feb 12, 2019, 11:54:30 PM2/12/19

to lavaan

I am not sure the lavaan.mi object can report the pooled bootstrap CI. Also, this would take a long time, as you would need to run the 2000 bootstrap (for example) for each imputed data set.

I would recommend to use the MonteCarlo sampling method, here is an example on how to do this with MI

# impose missing data for example

HSMiss <- HolzingerSwineford1939[ , c(paste("x", 1:9, sep = ""),
"ageyr","agemo","school")]

set.seed(12345)
HSMiss$x5 <- ifelse(HSMiss$x5 <= quantile(HSMiss$x5, .3), NA, HSMiss$x5)
age <- HSMiss$ageyr + HSMiss$agemo/12
HSMiss$x9 <- ifelse(age <= quantile(age, .3), NA, HSMiss$x9)

## specify CFA model from lavaan's ?cfa help page
HS.model <- '
visual =~ x1 + x2 + x3
textual =~ x4 + x5 + x6
speed =~ x7 + x8 + x9

textual ~ a*visual
speed ~ cp*visual + b*textual
ind := a*b
'

## ... or impute missing data first
library(Amelia)
set.seed(12345)
HS.amelia <- amelia(HSMiss, m = 20, noms = "school", p2s = FALSE)
imps <- HS.amelia$imputations

out2 <- cfa.mi(HS.model, data = imps, std.lv=T, verbose=T)
sm <- data.frame(summary(out2, ci=T))

abs <- sm[sm$label %in%c("a","b") ,"est"] ## save the parameter estimates a and b

ACM <- vcov(out2, type="pooled")[c("a","b"),c("a","b")] # get the pooled coavriance matrix of the parameters

monteCarloMed('a*b', coef1=abs[1], coef2=abs[2], ACM=ACM) ## estimate the montecarlo sampling method

Reply all

Reply to author

Forward

0 new messages