Hi Anil,
Depends a bit on the reason for exclusion.
a) The MHC is normally excluded from computing weights. The unusual LD in the MHC is liable to produce outlier LD scores that may substantially influence the regression, plus it’s not unreasonable to hypothesize that the assumptions about polygenicity and neutral genetic drift underlying ldsc might be a poor representation of effects in the region. The precomputed LD score weights provided for download should already reflect this exclusion.
b) For exclusion of loci with an extreme effect sizes (e.g. APoE), exclusion should happen in the summary statistics since necessary exclusions (if any) are going to be different for each phenotype. There’s not a specific recommended method for defining the exclusion. If you have a region for the locus defined by e.g. LD clumping that’s probably sufficient, or you could drop some region (e.g. 1 cM) around the lead SNP.
Also, if ldsc detects highly significant SNPs it suspects are outliers it will by default drop those top SNPs by imposing a threshold for maximum chi-square statistic. Should appear in the log file with the message:
"Removed X SNPs with chi^2 > X (X SNPs remain)”
Cheers,
Raymond