How to find the issue leads to "CORR_TOO

Dandan Peng

unread,

Oct 18, 2025, 5:55:59 PM (9 days ago) Oct 18

to plink2-users

Hi,

I'm running PLINK2 with simulated data. I want to include PC1 as a covariate to decrease the confounding effect brought by population structure. For some SNPs, I got NA for 'BETA', 'SE', 'T_STAT' and 'P'. The ERRCODE column shows 'CORR_TOO_HIGH'. I checked the correlation coefficient beteween PC1 and the genotype for those SNPs with Python. None of them show a correlation coefficient higher than 0.999 (the range is between -0.6~0.6). Do you have any thoughts on the potential reason for the error code? Thank you for the help!

Here's my command:

plink2 --pfile offspring_1child --pheno offspring_1child.pheno --pheno-name PHENO --glm --covar offspring_1child.covar --covar-name PC1 --out offspring_1child_gwas

Here's how the input data look like:

- Phenotype file

FID IID PHENO
FAM0 std1 -106.53436402346294
FAM1 std2 -70.19387845241816
FAM2 std3 -58.76111005636025
FAM3 std4 -113.04966863837323
FAM4 std5 -97.73902176007009
FAM5 std6 -79.56021192968942

- Covar file

FID IID PC1
FAM0 std1 3.797275709942582
FAM1 std2 4.2747284116346425
FAM2 std3 4.085876098266804
FAM3 std4 3.7928090917961015
FAM4 std5 4.208008149837333
FAM5 std6 3.8446688074863635

This is the log file:

PLINK v2.00a4.3 AVX2 (10 Jun 2023)
Options in effect:
--covar offspring_1child.covar
--covar-name PC1
--glm
--out offspring_1child_gwas
--pfile offspring_1child
--pheno offspring_1child.pheno
--pheno-name PHENO

Hostname: endeavour1.hpc.usc.edu
Working directory: litmus_test
Start time: Sat Oct 18 14:46:59 2025

Random number seed: 1760824019
191861 MiB RAM detected, ~138162 available; reserving 95930 MiB for main
workspace.
Using up to 64 threads (change this with --threads).
2500 samples (0 females, 0 males, 2500 ambiguous; 0 founders) loaded from
offspring_1child.psam.
5000 variants loaded from offspring_1child.pvar.
1 quantitative phenotype loaded (2500 values).
1 covariate loaded from offspring_1child.covar.
--glm linear regression on phenotype 'PHENO': done.
Results written to offspring_1child_gwas.PHENO.glm.linear .

End time: Sat Oct 18 14:46:59 2025

Chris Chang

unread,

Oct 18, 2025, 6:07:24 PM (9 days ago) Oct 18

to Dandan Peng, plink2-users

- Do you get the same result if you run with a current PLINK2 build?

- If yes, can you post a dataset and command that allows someone else to replicate what you’re seeing?

--
You received this message because you are subscribed to the Google Groups "plink2-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to plink2-users...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/plink2-users/73fe6f6e-3a10-4426-ba1a-9fd694288a01n%40googlegroups.com.

Dandan Peng

unread,

Oct 18, 2025, 10:27:54 PM (9 days ago) Oct 18

to plink2-users

I replied twice but can't see my message record. So I'm trying with google drive: the link to the dataset is here.

For the first point -- yes I tried alpha-6.5 version and still got the same results.

Chris Chang

unread,

Oct 18, 2025, 10:30:19 PM (9 days ago) Oct 18

to Dandan Peng, plink2-users

I saw the data files the first time around and can reproduce the issue; will post a bugfix tonight or tomorrow.

To view this discussion visit https://groups.google.com/d/msgid/plink2-users/e7b6ab82-56bc-4964-b524-999d8b8a3efcn%40googlegroups.com.

Christopher Chang

unread,

Oct 19, 2025, 2:35:07 PM (8 days ago) Oct 19

to plink2-users

Bugfix is now posted; let me know if you run into any more problems.

Reply all

Reply to author

Forward

How to find the issue leads to "CORR_TOO_HIGH" errorcode?

Dandan Peng

Chris Chang

Dandan Peng

Chris Chang

Christopher Chang