Hi guys,
I know many people have asked about this subject before, but I didn’t find something that gives me the answer I need.
I’m doing
Case-Control analysis using PLINK 1.9. During the pre-analysis QC
(which included “--maf 0.001” filter), I understood that PLINK
1.9 flips the A1/A2 order based on “minor”=A1 (and A1=ALT allele,
if I understand correctly).
In about 5% of my data, this is not the case, meaning the ALT allele has the bigger frequency.
I ran the pre-analysis QC (reminder: it included “--maf 0.001” filter) and the Case-Control analysis (using --assoc) twice: First I let PLINK work as usual, with the A1/A2 switching. In the second time I forced A2 as the “reference allele” (not ALT) by using “--a2-allele” flag (I checked, it worked).
Results:
The results of the QC (Number and identity of sample and SNPS) were exactly the same in the 2 analyses.
The results of the Case-Control analyses were also exactly the same.
How can it be? Am I wrong to understand that A1=ALT and A2=REF (and PLINK treats minor allele as the ALT allele)? Is there any effect of the A1/A2 switching on the –assoc function?
Thanks, Gabriel
plink2 \
--covar covarfile.txt \
--covar-name sex,age,PC1,PC2 \
--glm hide-covar cols=chrom,pos,alt1,ref,a1freq,beta,se,p \
--mach-r2-filter 0.3 1.0 \
--maf 0.01 \
--memory 64000 \
--out outfile \
--pfile genotypefileset \
--pheno phenofile.txt \
--pheno-name phenotype
#CHROM POS ID REF ALT1 A1 A1_FREQ BETA SE P
1 1 rs1 T C T 0.106321 0.0125737 0.0219298 0.56641
1 2 rs2 T C T 0.0177909 -0.00393126 0.0592781 0.947125
1 3 rs3 C T T 0.230796 -0.0195085 0.0212053 0.357598
1 4 rs4 C T T 0.201837 -0.0193924 0.0221503 0.381323