Hi,
I'd like to test genotypes against a quantitative trait for a large haploid genome, whereby all genotypes are either '0' or '1' .
The simplest statistical test for this, i.e. comparing the average trait value for '0' vs '1' - is a t-test. (or other two-sample test).
I can do this in R but it takes a long time with a million SNPs. So I was thinking that modification of PLINK code would be effective, considering this is such a simple test.
I don't have any experience in C/C++ but would happily give it a try if someone could point me in the right direction.
Sincerely,
William Gilks