faststructure and vcf tools

680 views
Skip to first unread message

Karine Durand

unread,
Jan 12, 2016, 3:34:52 PM1/12/16
to structure-software, karine...@angers.inra.fr
Hello
i try to make an imput file for faststructure. I have a vcf file but my SNP are in row and i have 272 000 SNPs. So how can i make a vcf file with my samples in row and SNPs in colunns? if i try to do it manually it's impossible because excel can't open 272000 colunns and i can't use vcftools --vcf mydata --plink to create a .fam, .bim and .bed files for faststructure.
Can you help me?
thanks

Vikram Chhatre

unread,
Jan 12, 2016, 3:37:03 PM1/12/16
to structure-software
Hi Karine,

You can use PGDspider to convert from VCF to fastStructure.  You will need a lot of memory for doing this.  So if you have access to a cluster, do this there.  Use as much memory as you have access to.

V


--
You received this message because you are subscribed to the Google Groups "structure-software" group.
To unsubscribe from this group and stop receiving emails from it, send an email to structure-softw...@googlegroups.com.
To post to this group, send email to structure...@googlegroups.com.
Visit this group at https://groups.google.com/group/structure-software.
For more options, visit https://groups.google.com/d/optout.

Karine Durand

unread,
Jan 13, 2016, 3:22:06 AM1/13/16
to structure-software, karine...@angers.inra.fr

Thanks a lot it's nice, but i have another question How can you create the .bam and .bim files with your PGDspider file?

Vikram Chhatre

unread,
Jan 25, 2016, 9:20:47 PM1/25/16
to structure-software
You can't with PGD, but you don't need to.  FastStructure can use either of the two formats:

- Structure
- Plink Bed

With PGD, you can convert VCF to Structure (make sure you use the fastStr option in the settings file).

V

--
Reply all
Reply to author
Forward
0 new messages