Hi everybody,
I conducted a GWAS (logistic regression) on two binary phenotypes, using the same genotype data. I use both dichotomous as well as continuous covariates in the model. I found that plink2 returned NA's for some SNPs for one phenotype, but valid test results for the other phenotype (and vice versa). The discrepancy leads to considerable differences in the number of valid test results per phenotype.
After trying out different things, I ran the same analyses in plink1. Oddly, plink1 did NOT produce NA's where plink 2 did! Attached you'll find the results for two phenotypes (A/B) and three SNPs conducted in both versions of plink (i.e. excerpts from the .assoc.logistic files).
Can anyone tell me what's going on here?
I know the implementation of logistic regression (with covariates) probably changed in plink2. However, I would like to know if, and how, this may have caused the issue.
Many thanks in advance,
Mats
P.S. Due to privacy concerns I cannot attach the raw data (and genomic positions in the attached results files are fictitious). However, if it helps I'm happy to run additional analyses to clarify this problem.