Dear all,
I have several questions about calculating the effective sample size (Neff).
2. Here are several other equations can be used to estimate the Neff. First, according to
wiki, Neff=4*v*(1-v). Second, according to Zhu et. al. (
https://doi.org/10.1038/ng.3538),
SE = 1/sqrt((Neff+Z^2)*2*MAF*(1-MAF)). I am wondering which one should be used in the Genomic SEM.
3. I tried to calculate the Neff of GWASs used in Grotzinger et. al. (
https://doi.org/10.1101/2021.09.22.21263909). However, I can not get the correct results. As shown in the following picture, the Neff of AN GWAS is 34,467, but the Neff in the summary statistics is 46321.9.
Also, the Neff of OCD GWAS is 5712, but the summed Neff of 7 cohorts (displayed in the following picture) is Neff=sum(4*v_k*(1-v_k)*n_k)=7281.
I also tried other equations above, but no one is correct. I am wondering how does the Neffs in above picture are calculated.
Acturally, the Neff in the summary statistics of
MDD GWAS is 69115, but the Neff of cohorts (displayed in the picture) is 137,301.
It seems that different GWAS use different methods to calculate Neff. I am wondering which equation should be used in Genomic SEM. Would you please share the code of the preprint (
https://doi.org/10.1101/2021.09.22.21263909) to me.
4. By the way, I used the latest anxiety GWAS (10.1001/jamapsychiatry.2019.1119) and several GWASs (available in PGC website) to obtain the genetic correlation matirx. The GCs are beyond 1 (range: 1.2~3.7). I assume the genetic correlation is close to [-1,1]. What happened here?
Thank you very much.
Best,
Jujiao