Since there are 677543 variants in my genotype data, each eigenvector should be of dimension 677543. Let's say if I requested 20 eigenvectors, then the matrix of eigenvectors should be of dimension 677543 * 20. I am not sure in the case of more columns than rows (i.e., more variants than individuals), if the dimension of eigenvector become truncated by the number of rows, which is 3222 for my data. Correct me if I am wrong.
While principal component scores, which is the projected value of each individual onto each eigenvector, should be of dimension 3222*20.
Given that the
.eigenvec file is of dimension 3222*20 (not including the first two columns of individual identifiers), I want to make sure if the numbers in eigenvec file mean eigenvector truncaed in dimension by sample size OR principal component scores along the eigenvectors?
In addition, I want to ask when correcting for PCs in GWAS, should I include
eigenvector truncaed in dimension by sample size OR principal component scores along the eigenvectors as covariates?
Xiaolv