Dealing with too many zero values of dependent variables

64 views
Skip to first unread message

Md. Shahin

unread,
Oct 17, 2022, 7:19:50 PM10/17/22
to lavaan
Dear Colleagues
I have designed a model, where there are two latent variables that predict two dependent variables. Socioeconomic variables are also being used as predictors along with latent variables. My problem is that there are too many zero values for dependent variables. How can I deal with this issue of zero values in SEM modelling? 
How can I interpret the results?

Your help is highly appreciated.  

Pat Malone

unread,
Oct 24, 2022, 12:29:50 PM10/24/22
to lav...@googlegroups.com
Look into "semicontinuous" or "two-part" models. You might also have a "hurdle" outcome. Those present a few options, depending in part on why you think you have so many zeroes.

--
You received this message because you are subscribed to the Google Groups "lavaan" group.
To unsubscribe from this group and stop receiving emails from it, send an email to lavaan+un...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/lavaan/9d39cf66-382b-4ca0-934b-0d2dc2c55e6cn%40googlegroups.com.


--
Patrick S. Malone, PhD
Sr Research Statistician, FAR HARBΦR
This message may contain confidential information; if you are not the intended recipient please notify the sender and delete the message.

Pat Malone

unread,
Oct 24, 2022, 12:33:05 PM10/24/22
to lav...@googlegroups.com
Another search term is "zero-inflated" outcomes.

Md. Shahin

unread,
Oct 31, 2022, 7:38:18 PM10/31/22
to lav...@googlegroups.com
Hi Pat
I am new to hurdle modelling. May I do that in lavaan?
Cheers!
Md. Shahin 
Associate Professor
Department of Disaster Resilience and Engineering
Patuakhali Science and Technology University, Bangladesh.
Web site: 
https://www.pstu.ac.bd/teachers/shahin



You received this message because you are subscribed to a topic in the Google Groups "lavaan" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/lavaan/nazeqXJAAjM/unsubscribe.
To unsubscribe from this group and all its topics, send an email to lavaan+un...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/lavaan/CAJhmz4f2SU5xUJ_%3DyqiA%3DzH4W14p-B%3DQxCd1DODQrfGkmmeSNw%40mail.gmail.com.

Terrence Jorgensen

unread,
Nov 7, 2022, 4:22:37 AM11/7/22
to lavaan
I am new to hurdle modelling. May I do that in lavaan?

Not in any automatic way.  There are papers showing how to hack Mplus by making 2 copies of each zero-inflated variable:
  • A binary indicator of whether it is nonzero (with NA assigned to truly missing values)
  • A continuous indicator of observed nonzero values, which is NA for any observations that are missing or 0
In the Mplus articles, they use marginal MLE to accommodate missingness with FIML, but MML estimation is not currently an option in lavaan.  With minimal (true) NAs, I think you might be able to use WLSMV or PML estimation, but I have not seen those evaluated with simulations, so I hesitate to recommend it.

Terrence D. Jorgensen
Assistant Professor, Methods and Statistics
Research Institute for Child Development and Education, the University of Amsterdam
 

Md. Shahin

unread,
Nov 7, 2022, 5:14:24 AM11/7/22
to lav...@googlegroups.com
Hi Terrence D. Jorgensen
Thanks for your suggestions. 

Cheers!
Md. Shahin 
Associate Professor
Department of Disaster Resilience and Engineering
Patuakhali Science and Technology University, Bangladesh.
Web site: 
https://www.pstu.ac.bd/teachers/shahin


--
You received this message because you are subscribed to a topic in the Google Groups "lavaan" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/lavaan/nazeqXJAAjM/unsubscribe.
To unsubscribe from this group and all its topics, send an email to lavaan+un...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages