Hello, I'm sorry to write question at friday afternoon.
While running munge_sumstats.py, I have a error about median value of beta.
Error message is the following sentences.
Interpreting column names as follows:
a1: Allele 1, interpreted as ref allele for signed sumstat.
pval: p-Value
beta: [linear/logistic] regression coefficient (0 --> no effect; above 0 --> A1 is trait/risk increasing)
snpid: Variant ID (e.g., rs number)
a2: Allele 2, interpreted as non-ref allele for signed sumstat.
Reading list of SNPs for allele merge from YY.assoc1.snp.snplist
Read 25361 SNPs for allele merge.
Reading sumstats from YY.assoc1.ld.txt into memory 5000000 SNPs at a time.
. done
Read 25361 SNPs from --sumstats file.
Removed 0 SNPs not in --merge-alleles.
Removed 0 SNPs with missing values.
Removed 0 SNPs with INFO <= 0.9.
Removed 0 SNPs with MAF <= 0.01.
Removed 0 SNPs with out-of-bounds p-values.
Removed 247 variants that were not SNPs or were strand-ambiguous.
25114 SNPs remain.
Removed 0 SNPs with duplicated rs numbers (25114 SNPs remain).
Using N = 25361.0
ERROR converting summary statistics:
Traceback (most recent call last):
File "C:/Users/inha/ldsc/munge_sumstats.py", line 701, in munge_sumstats
check_median(dat.SIGNED_SUMSTAT, signed_sumstat_null, 0.1, sign_cname))
File "C:/Users/inha/ldsc/munge_sumstats.py", line 373, in check_median
raise ValueError(msg.format(F=name, M=expected_median, V=round(m, 2)))
ValueError: WARNING: median value of beta is 0.27 (should be close to 0). This column may be mislabeled.
Conversion finished at Fri Jun 29 15:08:42 2018
Total time elapsed: 0.24s
Traceback (most recent call last):
File "C:/Users/inha/ldsc/munge_sumstats.py", line 746, in <module>
munge_sumstats(parser.parse_args(), p=True)
File "C:/Users/inha/ldsc/munge_sumstats.py", line 701, in munge_sumstats
check_median(dat.SIGNED_SUMSTAT, signed_sumstat_null, 0.1, sign_cname))
File "C:/Users/inha/ldsc/munge_sumstats.py", line 373, in check_median
raise ValueError(msg.format(F=name, M=expected_median, V=round(m, 2)))
ValueError: WARNING: median value of beta is 0.27 (should be close to 0). This column may be mislabeled.
Here, I don't know why ldsc calculate median value of beta and produce error like this.
I hope that someone explain why this error occurs and provide me the solution for this error.