FID IID PC1 PC2
FAM1 NWZ_106801_P03_WH01 -0.025861118 -0.07565873
FAM2 NWZ_106801_P01_WD11 -1.6138029 -0.08481272
FAM3 NWZ_106801_P03_WG01 -0.63852423 -0.121126294
FAM4 NWZ_106801_P02_WG03 -0.8286301 0.22903343
PCs were done in TASSEL. Anyway, I run the analysis:
PLINK v1.90b3 64-bit (11 Jan 2015)
9 arguments: --adjust --bfile myplinkPKsexphenoFID --covar cov_2pcs.txt --linear --out PKpcasperm --perm
Hostname: MacBook-Pro-de-Angela.local
Working directory: /Users/angelaparodymerino/Documents/genphen/PLINK_sd2/Allsites
Start time: Tue Nov 14 00:56:31 2017
Note: --perm flag deprecated. Use e.g. '--model perm'.
Random number seed: 1510617391
8192 MB RAM detected; reserving 4096 MB for main workspace.
3412 variants loaded from .bim file.
265 people (129 males, 136 females) loaded from .fam.
265 phenotype values loaded from .fam.
Using up to 8 threads (change this with --threads).
--covar: 2 covariates loaded.
Before main variant filters, 265 founders and 0 nonfounders present.
Calculating allele frequencies... done.
Total genotyping rate is 0.99727.
3412 variants and 265 people pass filters and QC.
Phenotype data is quantitative.
Writing linear model association results to PKpcasperm.assoc.linear ... done.
--adjust: Genomic inflation est. lambda (based on median chisq) = 1.11648.
--adjust values (3412 variants) written to PKpcasperm.assoc.linear.adjusted .
265303 (adaptive) permutations complete.
Permutation test report written to PKpcasperm.assoc.linear.perm .
End time: Tue Nov 14 00:56:34 2017
and obtained a quite much more inflated QQplot:
It might be because, according to what I have read (or I might be wrong?): covariate files only admit two values (1 or 2), and (0, -9 for missing values). Is it rounding all values of the PC1 and PC2 to 0, 1 or 2? (no idea).
Does anyone see what I might be doing wrong?
Or, should I use another way to correct the inflation due to population stratification?
Thanks in advance,
Ángela Parody-Merino