simsem: Problems to conduct simulation study with missing structure and using ML family estimator

110 views
Skip to first unread message

Jaakko Tammilehto

unread,
Jul 14, 2019, 1:21:09 PM7/14/19
to lavaan
Hi,

I have recently worked with simsem that is a great R package. I utilized lavaan syntax to form population and analysis models and then conducted MC simulation with 1000 replications using MLR as an estimator (which is my plan also in the actual study). The basic simulations without specifying the missing structure worked just fine.  However, more recently, I have tried to take into account the known missing structure in these simulations. I have used both miss(logit=script, m=0) as well as miss(pmMCAR=0.3, m=0) functions to define the missing structure (see R script below). For some reason, conducting MC simulations is not possible if the ML family estimator is used. The error message is as follow: 

Progress: 1 / 1000 
Error in lav_options_set(opt) : 
 lavaan ERROR: estimator ML for ordered data is not supported yet. Use WLSMV instead.


Using WLSMV as an estimator, I manage to run the MC simulations; however, as I did not specify the thresholds for the indicators (they should be handled as continuous), I got following warning message:

1: In lav_partable_check(lavpartable, categorical = lavoptions$categorical,  : lavaan WARNING: parameter table does not contain thresholds 


So, the questions that I have wondered without solution: 

1. Is it really so that it is not possible to conduct MC simulation with missing structure when using ML family estimator? When I tested this with this example https://github.com/simsem/simsem/blob/master/SupportingDocs/Examples/Version05/ex8/ex8.R I manage the run simulation with ML estimator. 

2. Or is the problem that data is handled as ordered? If so, how can I tell lavaan that all (endogenous) variables are continuous (I assumed that this should be automatic and lavaan only needs to know what variables are categorical (ordered=))? Or is there some other problem in my code?

Thank you for your help!

Best regards,
Jaakko Tammilehto

Here is the code:

#Used packages
install.packages("lavaan")
install.packages("simsem")
install.packages("simstandard")

library("lavaan")
library("simsem")
library("simstandard")

#POPULATION MODEL
AUTmodelPOP.CHILDHOOD<-'
#MOTHER-CHILD RELATIONSHIP AUTONOMY in infancy, middle childhood, and late adolescence
MAUTONOMY1=~0.8*maut1_1+0.7*maut1_2+0.8*maut1_3+0.7*maut1_4 
MAUTONOMY2=~0.8*maut2_1+0.7*maut2_2+0.8*maut2_3+0.7*maut2_4
MAUTONOMY3=~0.8*maut3_1+0.7*maut3_2+0.8*maut3_3+0.7*maut3_4

#FATHER-CHILD RELATIONSHIP AUTONOMY in infancy, middle childhood, and late adolescence
PAUTONOMY1=~0.7*paut1_1+0.8*paut1_2+0.7*paut1_3+0.8*paut1_4
PAUTONOMY2=~0.7*paut2_1+0.8*paut2_2+0.7*paut2_3+0.8*paut2_4
PAUTONOMY3=~0.7*paut3_1+0.8*paut3_2+0.7*paut3_3+0.8*paut3_4

#cross-sectional correlations between MOTHER-CHILD and FATHER-CHILD RELATIONSHIP AUTONOMY
MAUTONOMY1~~0.2*PAUTONOMY1
MAUTONOMY2~~0.15*PAUTONOMY2
MAUTONOMY3~~0.1*PAUTONOMY3

REAPPRAISAL=~1*REAPPRAISAL1
SUPPRESSION=~1*SUPPRESSION1
RUMINATION=~1*RUMINATION1

RUMINATION1~~0.20*RUMINATION1
REAPPRAISAL1~~0.25*REAPPRAISAL1
SUPPRESSION1~~0.25*SUPPRESSION1

#error correlations

maut1_1~~.07*maut2_1
maut1_1~~.03*maut3_1
maut2_1~~.07*maut3_1

maut1_2~~.07*maut2_2
maut1_2~~.03*maut3_2
maut2_2~~.07*maut3_2

maut1_3~~.07*maut2_3
maut1_3~~.03*maut3_3
maut2_3~~.07*maut3_3

maut1_4~~.07*maut2_4
maut1_4~~.03*maut3_4
maut2_4~~.07*maut3_4

paut1_1~~.07*paut2_1
paut1_1~~.03*paut3_1
paut2_1~~.07*paut3_1

paut1_2~~.07*paut2_2
paut1_2~~.03*paut3_2
paut2_2~~.07*paut3_2

paut1_3~~.07*paut2_3
paut1_3~~.03*paut3_3
paut2_3~~.07*paut3_3

paut1_4~~.07*paut2_4
paut1_4~~.03*paut3_4
paut2_4~~.07*paut3_4

#path coefficients
PAUTONOMY2~0.4*PAUTONOMY1+0.15*MAUTONOMY1+0*SEX+0.1*HOITO+0.2*SES
MAUTONOMY2~0.15*PAUTONOMY1+0.4*MAUTONOMY1+0*SEX+0.1*HOITO+0.2*SES
PAUTONOMY3~0.35*PAUTONOMY2+0.15*MAUTONOMY2+0*SEX+0*HOITO+0.1*SES
MAUTONOMY3~0.15*PAUTONOMY2+0.35*MAUTONOMY2+0*SEX+0*HOITO+0.1*SES

REAPPRAISAL~0.13*PAUTONOMY1+0.13*MAUTONOMY1+0.13*PAUTONOMY2+0.13*MAUTONOMY2+0.13*PAUTONOMY3+0.13*MAUTONOMY3+0*SEX+0*HOITO+0.1*SES
SUPPRESSION~(-0.13)*PAUTONOMY1+(-0.13)*MAUTONOMY1+(-0.13)*PAUTONOMY2+(-0.13)*MAUTONOMY2+(-0.13)*PAUTONOMY3+(-0.13)*MAUTONOMY3+(-0.2)*SEX+0*HOITO+(-0.1)*SES
RUMINATION~(-0.13)*PAUTONOMY1+(-0.13)*MAUTONOMY1+(-0.13)*PAUTONOMY2+(-0.13)*MAUTONOMY2+(-0.13)*PAUTONOMY3+(-0.13)*MAUTONOMY3+0.2*SEX+0*HOITO+(-0.1)*SES

#correlations between ER strategies
REAPPRAISAL~~(0.0)*SUPPRESSION
REAPPRAISAL~~(-0.1)*RUMINATION
SUPPRESSION~~0.1*RUMINATION

#correlations between covariates
SES~~0*HOITO
SEX~~0*SES
SEX~~0*HOITO
# Variances
maut1_1 ~~ 0.36 * maut1_1
maut2_1 ~~ 0.36 * maut2_1
maut1_2 ~~ 0.51 * maut1_2
maut2_2 ~~ 0.51 * maut2_2
maut1_3 ~~ 0.36 * maut1_3
maut2_3 ~~ 0.36 * maut2_3
maut1_4 ~~ 0.51 * maut1_4
maut2_4 ~~ 0.51 * maut2_4
paut1_1 ~~ 0.51 * paut1_1
paut2_1 ~~ 0.51 * paut2_1
paut1_2 ~~ 0.36 * paut1_2
paut2_2 ~~ 0.36 * paut2_2
paut1_3 ~~ 0.51 * paut1_3
paut2_3 ~~ 0.51 * paut2_3
paut1_4 ~~ 0.36 * paut1_4
paut2_4 ~~ 0.36 * paut2_4
REAPPRAISAL ~~ 0.7273162365475 * REAPPRAISAL
SUPPRESSION ~~ 0.6873162365475 * SUPPRESSION
RUMINATION ~~ 0.6873162365475 * RUMINATION
HOITO ~~ 1 * HOITO
SEX ~~ 1 * SEX
maut3_1 ~~ 0.36 * maut3_1
maut3_2 ~~ 0.51 * maut3_2
maut3_3 ~~ 0.36 * maut3_3
maut3_4 ~~ 0.51 * maut3_4
paut3_1 ~~ 0.51 * paut3_1
paut3_2 ~~ 0.36 * paut3_2
paut3_3 ~~ 0.51 * paut3_3
paut3_4 ~~ 0.36 * paut3_4
SES ~~ 1 * SES
MAUTONOMY1 ~~ 1 * MAUTONOMY1
MAUTONOMY2 ~~ 0.7435 * MAUTONOMY2
MAUTONOMY3 ~~ 0.791607375 * MAUTONOMY3
PAUTONOMY1 ~~ 1 * PAUTONOMY1
PAUTONOMY2 ~~ 0.7435 * PAUTONOMY2
PAUTONOMY3 ~~ 0.791607375 * PAUTONOMY3
'

#calculating variances for standardized parameters (variances have been added to model syntax)
m_complete <- model_complete(AUTmodelPOP.CHILDHOOD)
cat(m_complete)


#ANALYSIS MODEL
AUTmodel.CHILDHOOD<-'
#MOTHER-CHILD RELATIONSHIP AUTONOMY in infancy, middle childhood, and late adolescence
MAUTONOMY1=~maut1_1+maut1_2+maut1_3+maut1_4
MAUTONOMY2=~maut2_1+maut2_2+maut2_3+maut2_4
MAUTONOMY3=~maut3_1+maut3_2+maut3_3+maut3_4

#FATHER-CHILD RELATIONSHIP AUTONOMY in infancy, middle childhood, and late adolescence
PAUTONOMY1=~paut1_1+paut1_2+paut1_3+paut1_4
PAUTONOMY2=~paut2_1+paut2_2+paut2_3+paut2_4
PAUTONOMY3=~paut3_1+paut3_2+paut3_3+paut3_4

REAPPRAISAL=~1*REAPPRAISAL1
SUPPRESSION=~1*SUPPRESSION1
RUMINATION=~1*RUMINATION1

RUMINATION1~~0.20*RUMINATION1
REAPPRAISAL1~~0.25*REAPPRAISAL1
SUPPRESSION1~~0.25*SUPPRESSION1

#error correlations
maut1_1~~maut2_1
maut1_1~~maut3_1
maut2_1~~maut3_1

maut1_2~~maut2_2
maut1_2~~maut3_2
maut2_2~~maut3_2

maut1_3~~maut2_3
maut1_3~~maut3_3
maut2_3~~maut3_3

maut1_4~~maut2_4
maut1_4~~maut3_4
maut2_4~~maut3_4

paut1_1~~paut2_1
paut1_1~~paut3_1
paut2_1~~paut3_1

paut1_2~~paut2_2
paut1_2~~paut3_2
paut2_2~~paut3_2

paut1_3~~paut2_3
paut1_3~~paut3_3
paut2_3~~paut3_3

paut1_4~~paut2_4
paut1_4~~paut3_4
paut2_4~~paut3_4

#cross-sectional correlations between MOTHER-CHILD AND FATHER-CHILD RELATIONSHIP AUTONOMY
MAUTONOMY1~~PAUTONOMY1
MAUTONOMY2~~PAUTONOMY2
MAUTONOMY3~~PAUTONOMY3

#path coefficients
PAUTONOMY2~PA1*PAUTONOMY1+MA1*MAUTONOMY1+SEX+HOITO+SES
MAUTONOMY2~PA2*PAUTONOMY1+MA2*MAUTONOMY1+SEX+HOITO+SES
PAUTONOMY3~PA3*PAUTONOMY2+MA3*MAUTONOMY2+SEX+HOITO+SES
MAUTONOMY3~PA4*PAUTONOMY2+MA4*MAUTONOMY2+SEX+HOITO+SES

REAPPRAISAL~PARE1*PAUTONOMY1+MARE1*MAUTONOMY1+PARE2*PAUTONOMY2+MARE2*MAUTONOMY2+PARE3*PAUTONOMY3+MARE3*MAUTONOMY3+SEX+HOITO+SES
SUPPRESSION~PASU1*PAUTONOMY1+MASU1*MAUTONOMY1+PASU2*PAUTONOMY2+MASU2*MAUTONOMY2+PASU3*PAUTONOMY3+MASU3*MAUTONOMY3+SEX+HOITO+SES
RUMINATION~ PARU1*PAUTONOMY1+MARU1*MAUTONOMY1+PARU2*PAUTONOMY2+MARU2*MAUTONOMY2+PARU3*PAUTONOMY3+MARU3*MAUTONOMY3+SEX+HOITO+SES

#correlation between ER strategies
REAPPRAISAL ~~ SUPPRESSION
REAPPRAISAL ~~ RUMINATION
SUPPRESSION ~~ RUMINATION
'

#missing data structure
script<-'maut1_1~p(0)
maut1_2~p(0.0)
maut1_3~p(0.0)
maut1_4~p(0.0)
maut2_1~p(0.112)
maut2_2~p(0.112)
maut2_3~p(0.112)
maut2_4~p(0.112)
maut3_1~p(0.174)
maut3_2~p(0.174)
maut3_3~p(0.174)
maut3_4~p(0.174)
paut1_1~p(0.073)
paut1_2~p(0.073)
paut1_3~p(0.073)
paut1_4~p(0.073)
paut2_1~p(0.452)
paut2_2~p(0.452)
paut2_3~p(0.452)
paut2_4~p(0.452)
paut3_1~p(0.346)
paut3_2~p(0.346)
paut3_3~p(0.346)
paut3_4~p(0.346)
REAPPRAISAL1~p(0.196)
SUPPRESSION1~p(0.196)
RUMINATION1~p(0.196)
SEX~p(0.0)
HOITO~p(0.0)
SES~p(0.0)'

missingness<-miss(logit=script, m=0)

AUTanalysis.POPCHILDHOOD.CHILDHOOD <-sim(1000, n =546, model=AUTmodel.CHILDHOOD, seed=565, generate=AUTmodelPOP.CHILDHOOD, lavaanfun = "sem", missingness, std.lv=T, cilevel=0.90, estimator="MLR")

Progress: 1 / 1000 
Error in lav_options_set(opt) : 
  lavaan ERROR: estimator ML for ordered data is not supported yet. Use WLSMV instead.
Progress: 2 / 1000 
Error in lav_options_set(opt) : 
  lavaan ERROR: estimator ML for ordered data is not supported yet. Use WLSMV instead.

#the same thing with using MCAR
miss.model <- miss(pmMCAR=0.3, m=0)

AUTanalysis.POPCHILDHOOD.CHILDHOOD <-sim(1000, n =546, model=AUTmodel.CHILDHOOD, seed=565, generate=AUTmodelPOP.CHILDHOOD, lavaanfun = "sem", miss.model, std.lv=T, cilevel=0.90, estimator="MLR")

Progress: 1 / 1000 
Error in lav_options_set(opt) : 
  lavaan ERROR: estimator ML for ordered data is not supported yet. Use WLSMV instead.
Progress: 2 / 1000 
Error in lav_options_set(opt) : 
  lavaan ERROR: estimator ML for ordered data is not supported yet. Use WLSMV instead.

Alex Schoemann

unread,
Jul 14, 2019, 8:50:33 PM7/14/19
to lavaan
Hi Jaakko,

This is a bit odd. Looking over your code, the first thing I'd check is how you're calling the missing data object in the sim function. You don't include the name of the argument for either of the missing data templates (missingness or miss.model), thus I'm not sure exactly which argument the missing data information is being passed to.  I'd add miss = missingness (or miss.model) to your sim command and see if that fixes things.

Alex

Jaakko Tammilehto

unread,
Jul 15, 2019, 1:07:35 AM7/15/19
to lav...@googlegroups.com
Thank you for your help Alex,

It seems to help! However, at the same time, the simulation is really slow (i.e., 4 replication in 5 minutes). But perhaps this is because of the model complexity, and there is nothing to do about it.

Best,
Jaakko
--
You received this message because you are subscribed to the Google Groups "lavaan" group.
To unsubscribe from this group and stop receiving emails from it, send an email to lavaan+un...@googlegroups.com.
To post to this group, send email to lav...@googlegroups.com.
Visit this group at https://groups.google.com/group/lavaan.
To view this discussion on the web visit https://groups.google.com/d/msgid/lavaan/ce9564b9-3f54-486b-9b92-7b068b201d42%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Alex Schoemann

unread,
Jul 16, 2019, 9:08:07 AM7/16/19
to lavaan
Hmmm, that does sound slow. It's possible that everything is working fine and that's what's happening, but usually if things are going that slowly it implies that there are convergence difficulties with each model. I'd recommend running a small number of replications (e.g 5-10) and looking at the results before diving into a full 1000. That way if there's something wonky with the models that is causing convergence issues you can diagnose it more quickly.

Alex
To unsubscribe from this group and stop receiving emails from it, send an email to lav...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages