I am getting error while doing so as described above
plink2 --bfile Analysis --covar Covariate.txt --linear interaction dominant --no-sex --pheno TestPheno.txt --out Test --parameters 1-6,11 --no-pheno
Cov
FID IID Age Sex Ethnicity Infections_Status Combined
TAM501 TAM501 8.5 1 M V1 M+V1
258 samples (0 females, 0 males, 258 ambiguous; 258 founders) loaded from
Analysis.fam.
1642225 variants loaded from Analysis.bim.
3 quantitative phenotypes loaded.
5 covariates loaded from Covariate.txt.
Calculating allele frequencies... done.
Error: --parameters/--tests cannot currently be used directly with categorical
covariates; expand them into binary covariates with --split-cat-pheno first.
62 F1
68 F3
26 M1
102 M3
Step1 : run split-cat-pheno
plink2 --bfile Analysis --covar Covariate2.txt --no-sex --pheno TestPheno.txt --out Split --split-cat-pheno Combined --make-bed
Step2 : reheard the split covariate and manually change the header
awk '{print $1," ",$0}' Split.cov > New_Split.cov
Step3: Run association
Test 1: Running first 5 Covariates
/scratch/mv83/Software/plink2 --bfile Split --covar New_Split.cov --linear interaction dominant --no-sex --pheno TestPheno.txt --out Test --parameters 1-6 --no-pheno --threads 2
Output:
#CHROM POS ID REF ALT A1 TEST OBS_CT BETA SE T_STAT P
1 788538 h3a_37_1_723918_G_A G A A DOM 257 0.0254854 0.0897423 0.283984 0.776658
1 788538 h3a_37_1_723918_G_A G A A Age 257 -0.0172797 0.0174057 -0.99276 0.321787
1 788538 h3a_37_1_723918_G_A G A A Sex 257 0.11168 0.0769768 1.45083 0.148082
1 788538 h3a_37_1_723918_G_A G A A Combined=M1 257 0.0800373 0.140188 0.570927 0.568562
1 788538 h3a_37_1_723918_G_A G A A Combined=M3 257 0.0906461 0.0991741 0.91401 0.361592
1 788538 h3a_37_1_723918_G_A G A A Combined=F3 257 -0.0549291 0.106011 -0.518143 0.604817
Test 2: Running 6 Covariates (Error)
plink2 --bfile Split --covar New_Split.cov --linear interaction dominant --no-sex --pheno TestPheno.txt --out Test --parameters 1-7 --no-pheno --threads 2
PLINK v2.00a2LM 64-bit Intel (6 Oct 2019) www.cog-genomics.org/plink/2.0/
(C) 2005-2019 Shaun Purcell, Christopher Chang GNU General Public License v3
Logging to Test.log.
Options in effect:
--bfile Split
--covar New_Split.cov
--glm interaction dominant
--no-psam-pheno
--no-sex
--out Test
--parameters 1-7
--pheno TestPheno.txt
--threads 2
Start time: Mon May 9 13:55:55 2022
257154 MiB RAM detected; reserving 128577 MiB for main workspace.
Using up to 2 compute threads.
258 samples (0 females, 0 males, 258 ambiguous; 258 founders) loaded from
Split.fam.
1642225 variants loaded from Split.bim.
3 quantitative phenotypes loaded.
6 covariates loaded from New_Split.cov.
Calculating allele frequencies... done.
Warning: Skipping --glm regression on phenotype 'PPP1R9B', and other(s) with
identical missingness patterns, since covariate correlation matrix could not be
inverted (VIF_INFINITE). You may want to remove redundant covariates and try
again.
Test 3: Running5 Covariates with Interaction by skipping the issue Covariate
plink2 --bfile Split --covar New_Split.cov --linear interaction dominant --no-sex --pheno TestPheno.txt --out Test --parameters 1-6,10-12 --no-pheno --threads 2
1 788538 h3a_37_1_723918_G_A G A A DOM 257 NA NA NA NA
1 788538 h3a_37_1_723918_G_A G A A Age 257 NA NA NA NA
1 788538 h3a_37_1_723918_G_A G A A Sex 257 NA NA NA NA
1 788538 h3a_37_1_723918_G_A G A A Combined=M1 257 NA NA NA NA
1 788538 h3a_37_1_723918_G_A G A A Combined=M3 257 NA NA NA NA
1 788538 h3a_37_1_723918_G_A G A A Combined=F3 257 NA NA NA NA
1 788538 h3a_37_1_723918_G_A G A A DOMxCombined=M1 257 NA NA NA NA
1 788538 h3a_37_1_723918_G_A G A A DOMxCombined=M3 257 NA NA NA NA
1 788538 h3a_37_1_723918_G_A G A A DOMxCombined=F3 257 NA NA NA NA
I have tried using the recent build (PLINK v2.00a3LM 64-bit Intel (3 May 2022) ) and also tried --max-corr and --vif 999999 but has the same issue
Issues :
1) what's the issue by including the F1 category and how to fix the issue (Test2).
Run on the current Version of Plink
./plink2 --bfile Split --covar New_Split.cov --linear interaction dominant --no-sex --pheno TestPheno.txt --out Test --parameters 1-7 --no-pheno --threads 2 --warning-errcode
PLINK v2.00a3LM 64-bit Intel (3 May 2022) www.cog-genomics.org/plink/2.0/
(C) 2005-2022 Shaun Purcell, Christopher Chang GNU General Public License v3
Logging to Test.log.
Options in effect:
--bfile Split
--covar New_Split.cov
--glm interaction dominant
--no-psam-pheno
--no-sex
--out Test
--parameters 1-7
--pheno TestPheno.txt
--threads 2
--warning-errcode
Start time: Tue May 10 11:02:29 2022
257154 MiB RAM detected; reserving 128577 MiB for main workspace.
Using up to 2 compute threads.
258 samples (0 females, 0 males, 258 ambiguous; 258 founders) loaded from
Split.fam.
1642225 variants loaded from Split.bim.
3 quantitative phenotypes loaded.
6 covariates loaded from New_Split.cov.
Calculating allele frequencies... done.
Error: Cannot proceed with --glm regression on phenotype 'PPP1R9B', and
other(s) with identical missingness patterns, since covariate correlation
matrix could not be inverted (VIF_INFINITE). You may want to remove redundant
covariates and try again.
End time: Tue May 10 11:02:29 2022
2) I am little bit confused and did not understand by removing the covariate.FID IID Age Sex Combined
SAM11 SAM11 8.5 1 M1
SAM12 SAM12 5.9 1 M3
SAM13 SAM13 8.5 2 M1
SAM15 SAM15 7.1 1 M3
SAM17 SAM17 10.5 2 M1
Step1 : run split-cat-pheno
plink2 --bfile Analysis --covar Covariate2.txt --no-sex --pheno TestPheno.txt --out Split --split-cat-pheno Combined --make-bed
#IID Age Sex Combined=F1 Combined=F3 Combined=M1 Combined=M3
SAM19 12.1 1 1 1 1 2
SAM117 10.1 1 1 2 1 1
SAM133 11.1 2 1 1 1 2
SAM197 8.5 1 1 1 1 2