Comparing of heritability estimates of ldsc and GCTA

442 views
Skip to first unread message

Ganesh Chauhan

unread,
Apr 13, 2015, 11:10:21 AM4/13/15
to ldsc_...@googlegroups.com
Hi,

LDSC is a great tool and is very useful in this era of GWAS where we already have multiple published GWAS of large scale.

I was interested in doing some "heritability estimates" and "genetic correlation" of two quantitative phenotypes based on their summary statistics. For comparison shake I used a study which has imputed data and GWAS results for two quantitative trait.
I used the high quality markers that we get after using "munge_sumstats.py" to perform genetic correlation using "ldsc.py". The syntax used are mentioned below.

python munge_sumstats.py --sumstats file1.input.gz --N 1397 --out file1 --merge-alleles w_hm3.snplist

ldsc.py \
--rg file1.sumstats.gz,file1.sumstats.gz \
--ref-ld-chr eur_w_ld_chr/ \
--w-ld-chr eur_w_ld_chr/ \
--out checking

Then I used the same set of markers that were used for estimating the correlation and estimated heritablity using GCTA. This was just to make sure we are on the same page. However, I find that the heritablity estimates for the two phenotypes are much higher when using ldsc than GCTA. The results of GCTA using whole genome data and the selective list of SNPs are almost comparable. So I do not think the SNP list makes such a huge difference.

GCTA

LDSC

phenotype1

0.42 (0.23)

0.759 (0.3352)

phenotype2

0.33 (0.23)

0.681 (0.3361)


Any comments as to why we differ so much in terms of heritability estimates between the two software. 


With best regards,
Ganesh Chauhan
INSERM, FRANCE

Brendan

unread,
Apr 14, 2015, 8:28:33 AM4/14/15
to ldsc_...@googlegroups.com
Hi Ganesh,

The difference between those two estimates is about 1 SE, so not statistically significant (actually none of those h2 estimates is significantly different from zero). It looks like the issue is that your sample size (1397) is not large, so the estimates from both GCTA and LD Score are very noisy. At higher sample size (i.e., less noise) and in simulations, we have observed that GCTA and LD Score tend to give very similar results.

Ganesh Chauhan

unread,
Apr 16, 2015, 12:01:41 PM4/16/15
to ldsc_...@googlegroups.com
Thanks a lot Brendan.
Reply all
Reply to author
Forward
0 new messages