I have VCF file for multiple sample of a population. How can I use this file in structure software ? What would be the best workflow from NGS raw reads to structure ? I have used Stacks but it has few concerns in how it writes structure file from SNP data.
--
You received this message because you are subscribed to the Google Groups "structure-software" group.
To unsubscribe from this group and stop receiving emails from it, send an email to structure-softw...@googlegroups.com.
To post to this group, send email to structure...@googlegroups.com.
Visit this group at http://groups.google.com/group/structure-software.
For more options, visit https://groups.google.com/d/optout.
Great question! There are two ways to do this:1. Use vcftools to export to .ped format with the --plink switch. Then convert the .ped to .bed using Plink. Faststructure will read the .bed format files (there are three files for each project).2. Use pgdSpider to convert from vcf to structure or faststructure. Make sure to run --maf filter in vcftools prior to converting the files, to get rid of any monomorphic loci. The --min-alleles and --max-alleles, even when both set at a value of 2 do not get rid of monomorphic sites. The --maf filter (keep it very low) will accomplish this. If monomorphic sites are retained, pgdSpider is unable to properly convert the data.PgdSpider needs a popfile containing sample id and pop id columns, and an SPID file which is a simple text parameter file which you can generate from the gui.I prefer # 2. Let me know if you have more questions.V
On Sat, Feb 7, 2015 at 4:45 AM, geek_y <goutha...@gmail.com> wrote:
I have VCF file for multiple sample of a population. How can I use this file in structure software ? What would be the best workflow from NGS raw reads to structure ? I have used Stacks but it has few concerns in how it writes structure file from SNP data.
--
You received this message because you are subscribed to the Google Groups "structure-software" group.
To unsubscribe from this group and stop receiving emails from it, send an email to structure-software+unsub...@googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to structure-softw...@googlegroups.com.
I have tried using PGDSpider to convert a vcf file but get the warning message that bases have been converted to integers. So my file contains -9 for missing data and only ints 1-4 for each genotype. Can STRUCTURE accept this format or do you know why this error may be occurring?
Thanks
--
You received this message because you are subscribed to the Google Groups "structure-software" group.
To unsubscribe from this group and stop receiving emails from it, send an email to structure-softw...@googlegroups.com.
To post to this group, send email to structure...@googlegroups.com.
Visit this group at https://groups.google.com/group/structure-software.