I have successfully converted all my autosomal chromosomes from bgen files to pgen files and then ran basic association test without any problem. Thanks!!
Now I am working on to do the same thing on UKB chrX file. The bgen to pgen process was OK, but the association test gave the following error message:
Actually I am using the same phenotype file with the other chromosomes, just deleted the samples not in chrX sample file. intBFP is just a continuous phenotype derived from body fat percentage, and it was running just fine in the other 22 chromosomes. Do you have any clue what might be the reason and how could I modify my script or the input file?
PLINK v2.00a1LM 64-bit Intel (11 Feb 2018)
Options in effect:
--covar ...../sampleX.sample
--covar-name sex
--glm cols=+a1count,+machr2
--memory 50000
--out BFP_chrX
--pfile .../chrX_good_pfile
--pheno ..../sampleX.sample
--pheno-name intBFP
--remove ....../BFPexcludeX
Hostname: sbcs2
Working directory: ..../GWAS_BFP
Start time: Thu Jul 26 18:50:27 2018
Random number seed: 1532649027
258218 MB RAM detected; reserving 50000 MB for main workspace.
Using up to 72 threads (change this with --threads).
486757 samples (263910 females, 222833 males, 14 ambiguous; 486757 founders)
loaded from ..../chrX_good_pfile.psam.
858805 variants loaded from
....../chrX_good_pfile.pvar.
1 quantitative phenotype loaded (288791 values).
--remove: 288791 samples remaining.
1 covariate loaded from ....../sampleX.sample.
288791 samples (157049 females, 131742 males; 288791 founders) remaining after
main filters.
288791 quantitative phenotype values remaining after main filters.
Warning: Skipping --glm regression on phenotype 'intBFP' since covariate
correlation matrix could not be inverted. You may want to remove redundant
covariates and try again.
End time: Thu Jul 26 18:50:44 2018
I have googled previous post and saw similar question. But I have tried to alter my --pheno file to contain only ID and intBFP, and my --covar file to contain only ID and sex. Still I have the same Warning and the process skip --glm. The weird thing is it only happens in chrX processing. When I process autosomal chromosomes, there is no issue on running linear regression, even I have more covariates in the file (maybe correlated, but no used in each regression).