You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to flashpca-users
Hello,
I was just curious how flashpca handles missing genotypes. Does a sample with a missing genotype at a certain site contribute to the variance? Or is it ignored.
Thank you,
Nicole
Gad Abraham
unread,
Dec 10, 2018, 8:46:34 PM12/10/18
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Nicole Torosin, flashpca-users
Hi Nicole,
Missing genotypes are imputed to the average for that SNP, which after
standardisation becomes zero. The average and sd are based on the
non-missing genotypes for that SNP, so a sample with a missing
phenotype does not contribute.
If you have SNPs with high missingness, you should probably exclude
them beforehand using plink.
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to flashpca-users
Thank you so much.
On a related note, if I am evaluating X chromosome data using males and females, how does the PCA handle a combination of diploid and haploid genotypes?
Would you recommend using only females for this type of analysis?
Thank you again,
Nicole
Gad Abraham
unread,
Jan 2, 2019, 7:39:49 AM1/2/19
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Nicole Torosin, flashpca-users
Hi Nicole,
It really depends on what you're trying to do; if you want to detect
population structure or other large-scale variation like that, then
you don't need chr X, and the autosomal chromosomes will suffice
(assuming it's human). If you really want to do an analysis of chr X,
simply putting chr X genotypes in the PCA will probably "work" (not
crash) but may give garbage results, but I don't have an answer off
the top of my head how to do it in a sensible way that accounts for
sex.