Hi,
Thanks for your quick response. I understand --maf and --glm should definitely consider dosage info. My major question is the criteria for --geno. It seems there are a lot of missing data for my dosage data. I am thinking what is the best way to save more data in relatively high-quality criteria. BTW, I have excluded the SNPs with Rsq (<0.3) in miniMac4 imputation data. I am also interested in the criteria for --hwe with dosage data. Here is the full log.
Thanks again!
(C) 2005-2020 Shaun Purcell, Christopher Chang GNU General Public License v3
Logging to impute030_n3_all_chr22_plink190_plink190losgistic.log.
Options in effect:
--allow-no-sex
--bfile impute030_n3_all_chr22_plink190
--geno 0.1
--hwe 1.0e-20 midp
--logistic
--maf 0.01
--out impute030_n3_all_chr22_plink190_plink190losgistic
--pheno temp_n3_cau_phen_lt12_lifeaud5
--pheno-name lt12_lifeaud5
257652 MB RAM detected; reserving 128826 MB for main workspace.
100322 variants loaded from .bim file.
22848 people (0 males, 0 females, 22848 ambiguous) loaded from .fam.
Ambiguous sex IDs written to
impute030_n3_all_chr22_plink190_plink190losgistic.nosex .
8446 phenotype values present after --pheno.
Using 1 thread (no multithreaded calculations invoked).
Before main variant filters, 22848 founders and 0 nonfounders present.
Calculating allele frequencies... 0% 1% 2% 3% 4% 5% 6% 7% 8% 9% 10% 11% 12% 13% 14% 15% 16% 17% 18% 19% 20% 21% 22% 23% 24% 25% 26% 27% 28% 29% 30% 31% 32% 33% 34% 35% 36% 37% 38% 39% 40% 41% 42% 43% 44% 45% 46% 47% 48% 49% 50% 51% 52% 53% 54% 55% 56% 57% 58% 59% 60% 61% 62% 63% 64% 65% 66% 67% 68% 69% 70% 71% 72% 73% 74% 75% 76% 77% 78% 79% 80% 81% 82% 83% 84% 85% 86% 87% 88% 89% 90% 91% 92% 93% 94% 95% 96% 97% 98% 99% done.
0 variants removed due to missing genotype data (--geno).
--hwe: 0 variants removed due to Hardy-Weinberg exact test.
44324 variants removed due to minor allele threshold(s)
(--maf/--max-maf/--mac/--max-mac).
55998 variants and 22848 people pass filters and QC.
Among remaining phenotypes, 4487 are cases and 3959 are controls. (14402
phenotypes are missing.)
Writing logistic model association results to
impute030_n3_all_chr22_plink190_plink190losgistic.assoc.logistic ... 0% 1% 2% 3% 4% 5% 6% 7% 8% 9% 10% 11% 12% 13% 14% 15% 16% 17% 18% 19% 20% 21% 22% 23% 24% 25% 26% 27% 28% 29% 30% 31% 32% 33% 34% 35% 36% 37% 38% 39% 40% 41% 42% 43% 44% 45% 46% 47% 48% 49% 50% 51% 52% 53% 54% 55% 56% 57% 58% 59% 60% 61% 62% 63% 64% 65% 66% 67% 68% 69% 70% 71% 72% 73% 74% 75% 76% 77% 78% 79% 80% 81% 82% 83% 84% 85% 86% 87% 88% 89% 90% 91% 92% 93% 94% 95% 96% 97% 98% 99% done.
(C) 2005-2021 Shaun Purcell, Christopher Chang GNU General Public License v3
Logging to impute030_n3_all_chr22_plink190_plink2glm.log.
Options in effect:
--allow-no-sex
--bfile impute030_n3_all_chr22_plink190
--geno 0.1
--glm allow-no-covars
--hwe 1.0e-20 midp
--maf 0.01
--out impute030_n3_all_chr22_plink190_plink2glm
--pheno temp_n3_cau_phen_lt12_lifeaud5
--pheno-name lt12_lifeaud5
Start time: Wed Mar 24 18:07:20 2021
Note: --allow-no-sex no longer has any effect. (Missing-sex samples are
automatically excluded from association analysis when sex is a covariate, and
treated normally otherwise.)
257652 MiB RAM detected; reserving 128826 MiB for main workspace.
Using up to 56 threads (change this with --threads).
22848 samples (0 females, 0 males, 22848 ambiguous; 22848 founders) loaded from
impute030_n3_all_chr22_plink190.fam.
100322 variants loaded from impute030_n3_all_chr22_plink190.bim.
1 binary phenotype loaded (4487 cases, 3959 controls).
Calculating allele frequencies... 0% 65% done.
--geno: 0 variants removed due to missing genotype data.
--hwe midp: 14120 variants removed due to Hardy-Weinberg exact test (founders
only).
44320 variants removed due to allele frequency threshold(s)
(--maf/--max-maf/--mac/--max-mac).
41882 variants remaining after main filters.
--glm logistic-Firth hybrid regression on phenotype 'lt12_lifeaud5': 0% 68% done.
Results written to impute030_n3_all_chr22_plink190_plink2glm.lt12_lifeaud5.glm.logistic.hybrid .
End time: Wed Mar 24 18:07:23 2021
(C) 2005-2021 Shaun Purcell, Christopher Chang GNU General Public License v3
Logging to impute030_n3_all_chr22_plink2_glm.log.
Options in effect:
--allow-no-sex
--geno 0.1
--glm allow-no-covars
--hwe 1.0e-20 midp
--maf 0.01
--out impute030_n3_all_chr22_plink2_glm
--pfile impute030_n3_all_chr22_plink2
--pheno temp_n3_cau_phen_lt12_lifeaud5
--pheno-name lt12_lifeaud5
Start time: Wed Mar 24 18:07:23 2021
Note: --allow-no-sex no longer has any effect. (Missing-sex samples are
automatically excluded from association analysis when sex is a covariate, and
treated normally otherwise.)
257652 MiB RAM detected; reserving 128826 MiB for main workspace.
Using up to 56 threads (change this with --threads).
22848 samples (0 females, 0 males, 22848 ambiguous; 22848 founders) loaded from
impute030_n3_all_chr22_plink2.psam.
100322 variants loaded from impute030_n3_all_chr22_plink2.pvar.
1 binary phenotype loaded (4487 cases, 3959 controls).
Calculating allele frequencies... 0% 65% done.
--geno: 0 variants removed due to missing genotype data.
--hwe midp: 14120 variants removed due to Hardy-Weinberg exact test (founders
only).
44320 variants removed due to allele frequency threshold(s)
(--maf/--max-maf/--mac/--max-mac).
41882 variants remaining after main filters.
--glm logistic-Firth hybrid regression on phenotype 'lt12_lifeaud5': 0% 68% done.
Results written to impute030_n3_all_chr22_plink2_glm.lt12_lifeaud5.glm.logistic.hybrid .
End time: Wed Mar 24 18:07:26 2021
(C) 2005-2021 Shaun Purcell, Christopher Chang GNU General Public License v3
Logging to impute030_n3_all_chr22_plink2dosage_glm.log.
Options in effect:
--allow-no-sex
--geno 0.1
--glm allow-no-covars
--hwe 1.0e-20 midp
--maf 0.01
--out impute030_n3_all_chr22_plink2dosage_glm
--pfile impute030_n3_all_chr22_plink2dosage
--pheno temp_n3_cau_phen_lt12_lifeaud5
--pheno-name lt12_lifeaud5
Start time: Wed Mar 24 18:07:26 2021
Note: --allow-no-sex no longer has any effect. (Missing-sex samples are
automatically excluded from association analysis when sex is a covariate, and
treated normally otherwise.)
257652 MiB RAM detected; reserving 128826 MiB for main workspace.
Using up to 56 threads (change this with --threads).
22848 samples (0 females, 0 males, 22848 ambiguous; 22848 founders) loaded from
impute030_n3_all_chr22_plink2dosage.psam.
100322 variants loaded from impute030_n3_all_chr22_plink2dosage.pvar.
1 binary phenotype loaded (4487 cases, 3959 controls).
Calculating allele frequencies... 0% 65% done.
--geno: 33579 variants removed due to missing genotype data.
--hwe midp: 3710 variants removed due to Hardy-Weinberg exact test (founders
only).
40955 variants removed due to allele frequency threshold(s)
(--maf/--max-maf/--mac/--max-mac).
22078 variants remaining after main filters.
--glm logistic-Firth hybrid regression on phenotype 'lt12_lifeaud5': 0% 67% done.
Results written to impute030_n3_all_chr22_plink2dosage_glm.lt12_lifeaud5.glm.logistic.hybrid .
End time: Wed Mar 24 18:07:32 2021
(C) 2005-2021 Shaun Purcell, Christopher Chang GNU General Public License v3
Logging to impute030_n3_all_chr22_plink2_glm_requirePheno.log.
Options in effect:
--allow-no-sex
--geno 0.1
--glm allow-no-covars
--hwe 1.0e-20 midp
--maf 0.01
--out impute030_n3_all_chr22_plink2_glm_requirePheno
--pfile impute030_n3_all_chr22_plink2
--pheno temp_n3_cau_phen_lt12_lifeaud5
--pheno-name lt12_lifeaud5
--require-pheno
Start time: Wed Mar 24 18:07:32 2021
Note: --allow-no-sex no longer has any effect. (Missing-sex samples are
automatically excluded from association analysis when sex is a covariate, and
treated normally otherwise.)
257652 MiB RAM detected; reserving 128826 MiB for main workspace.
Using up to 56 threads (change this with --threads).
22848 samples (0 females, 0 males, 22848 ambiguous; 22848 founders) loaded from
impute030_n3_all_chr22_plink2.psam.
100322 variants loaded from impute030_n3_all_chr22_plink2.pvar.
1 binary phenotype loaded (4487 cases, 3959 controls).
--require-pheno: 14402 samples removed.
8446 samples (0 females, 0 males, 8446 ambiguous; 8446 founders) remaining
after main filters.
4487 cases and 3959 controls remaining after main filters.
Calculating allele frequencies... 0% 65% done.
--geno: 0 variants removed due to missing genotype data.
--hwe midp: 0 variants removed due to Hardy-Weinberg exact test (founders
only).
55695 variants removed due to allele frequency threshold(s)
(--maf/--max-maf/--mac/--max-mac).
44627 variants remaining after main filters.
--glm logistic-Firth hybrid regression on phenotype 'lt12_lifeaud5': 0% 67% done.
Results written to impute030_n3_all_chr22_plink2_glm_requirePheno.lt12_lifeaud5.glm.logistic.hybrid .
End time: Wed Mar 24 18:07:34 2021
(C) 2005-2021 Shaun Purcell, Christopher Chang GNU General Public License v3
Logging to impute030_n3_all_chr22_plink2dosage_glm__requirePheno.log.
Options in effect:
--allow-no-sex
--geno 0.1
--glm allow-no-covars
--hwe 1.0e-20 midp
--maf 0.01
--out impute030_n3_all_chr22_plink2dosage_glm__requirePheno
--pfile impute030_n3_all_chr22_plink2dosage
--pheno temp_n3_cau_phen_lt12_lifeaud5
--pheno-name lt12_lifeaud5
--require-pheno
Start time: Wed Mar 24 18:07:34 2021
Note: --allow-no-sex no longer has any effect. (Missing-sex samples are
automatically excluded from association analysis when sex is a covariate, and
treated normally otherwise.)
257652 MiB RAM detected; reserving 128826 MiB for main workspace.
Using up to 56 threads (change this with --threads).
22848 samples (0 females, 0 males, 22848 ambiguous; 22848 founders) loaded from
impute030_n3_all_chr22_plink2dosage.psam.
100322 variants loaded from impute030_n3_all_chr22_plink2dosage.pvar.
1 binary phenotype loaded (4487 cases, 3959 controls).
--require-pheno: 14402 samples removed.
8446 samples (0 females, 0 males, 8446 ambiguous; 8446 founders) remaining
after main filters.
4487 cases and 3959 controls remaining after main filters.
Calculating allele frequencies... 0% 65% done.
--geno: 30951 variants removed due to missing genotype data.
--hwe midp: 594 variants removed due to Hardy-Weinberg exact test (founders
only).
52549 variants removed due to allele frequency threshold(s)
(--maf/--max-maf/--mac/--max-mac).
16228 variants remaining after main filters.
--glm logistic-Firth hybrid regression on phenotype 'lt12_lifeaud5': 0% 65% done.
Results written to impute030_n3_all_chr22_plink2dosage_glm__requirePheno.lt12_lifeaud5.glm.logistic.hybrid .
End time: Wed Mar 24 18:07:40 2021