Hello everyone,
I am currently having issues with parameter estimates for a latent factor with only 2 indicators and was wondering if any one has encountered a similar issue previously and knows of any potential solutions.
I am trying to test a 2 factor model of the big-5 personality traits that has been described previously in the psychological literature to see if the same model would apply to the genetic correlations between the traits. The problem is that one of these higher order factors are defined as the shared variance between openness and extraversion (only two indicators). The factor for the other 3 traits seems hold up well across all models I have tested below.
The genetic correlation matrix produced by LDSC regression suggests that this higher factor structure may also hold up genetically as well, extraversion and openness are correlated at 0.38, the strongest correlation out of all the traits (matrix below).
agree consc extra invneuro openn
[1,] 1.00000000 0.22913633 0.2218264 0.36282889 0.09954667
[2,] 0.22913633 1.00000000 0.1553970 0.11042669 -0.09678278
[3,] 0.22182644 0.15539701 1.0000000 0.20712439 0.38345179
[4,] 0.36282889 0.11042669 0.2071244 1.00000000 0.01510956
[5,] 0.09954667 -0.09678278 0.3834518 0.01510956 1.00000000
When I test the below model I get decent fit metrics, however the estimates for extraversion are highly out of bounds due to the latent factor only having 2 indicators (F2~extra = 1.9, extra~extra = -2.7).
F1=~NA*agree+invneuro+consc
F2=~NA*openn+extra
F1~~1*F1
F2~~1*F2
chisq df p_chisq AIC CFI SRMR
df 32.50444 4 1.508672e-06 54.50444 0.935647 0.03967861

I then tested a model constraining the extraversion residual to be within bounds which produced similar fit indices, however now the latent factor becomes almost identical to extraversion when standardised (F2~extra = 1.0, F2~open = 0.36; LDSC correlation between openness and extraversion is 0.38).
F1=~NA*agree+invneuro+consc
F2=~NA*extra+openn
F1~~1*F1
F2~~1*F2
extra~~a*extra
a > 0.001
chisq df p_chisq AIC CFI SRMR
df 39.08002 4 6.706292e-08 61.08002 0.9208017 0.04307385

Out of curiosity I then tested the same model however I input openness before extraversion which produces standardised estimates which look much more sensible (F2~extra = 0.38, F2~open = 0.36) with almost identical fit indices.
F1=~NA*agree+invneuro+consc
F2=~NA*openn+extra
F1~~1*F1
F2~~1*F2
extra~~a*extra
a > 0.001
chisq df p_chisq AIC CFI SRMR
df 39.07998 4 6.706409e-08 61.07998 0.9208018 0.04307385

Is there any other ways of improving the quality of estimates for the openness/extraversion factor or are my hands tied by the data I have available?
If there is no work around, could I continue forward with the model based on prior theory with the big caveat that the openness/extraversion factor is likely empirically under-identified or should I just dump the factor altogether?
Apologies for the long post, I just wanted to be thorough. I'm very new to genomic SEM and would very much appreciate any insight.
Thanks heaps!
Liam