About Fst calculation in Plink.

2,166 views
Skip to first unread message

Bernabé Ignacio Bustos Becerra

unread,
Apr 9, 2015, 12:13:41 PM4/9/15
to plink2...@googlegroups.com
Dear Dr. Chang,

I'm working in a population genetics project using SNPs called from whole-genome sequences. One of my task is to calculate genetic differentiation with other populations (i.e 1000 genomes project) so I'm using Plink's Fst to do this. After doing pairwise calculations and getting Fst values for each SNP, I understand that in order to select variants with statistically significant Fst values I need to do a permutation process to create an empirical distribution and then use methods like Bonferroni-corrected permutation test P-value [1]. I found a helpful topic on Biostars (https://www.biostars.org/p/115453/)  where the user Zev.Kronenberg pointed out the necessary steps to follow. That's why I would like to know if this analysis or an alternative could be implemented in PLINK's Fst calculations?.

I hope you can hep me, I'll look forward to your comments.

Thanks in advance,

Best regards, 

Bernabé Bustos

Christopher Chang

unread,
Apr 10, 2015, 2:30:06 AM4/10/15
to plink2...@googlegroups.com
I will look into adding pFst to PLINK in the future.  For now, though, you are probably best off using the GPAT++ tool.

Bernabé Ignacio Bustos Becerra

unread,
Apr 10, 2015, 11:15:52 AM4/10/15
to plink2...@googlegroups.com
Thakns for your answer.
I have tried GPAT++ but pFST actually computes a chi-square statistic comparing allele frequency between 2 populations , it doesn't use heterozygosity so is not a real WC Fst estimator. I'll keep looking forward to se a way to boostrap plink Fst output to get a null distribution and then calculate Pvalues in order to detect significantly divergent SNPs. It would be nice to see this implemented in PLINK as standar procedure in evolutionary genetics.

Thanks again, have a great weekend.

Cheers,

Bernabé
Reply all
Reply to author
Forward
0 new messages