Help with SNP calling

0 views
Skip to first unread message

Barney Wharam

unread,
Aug 12, 2013, 11:15:02 AM8/12/13
to ngsshef
Dear All,

In reference to the NGS workshop in Sheffield that I attended I've been trying to use Victor's RAD mapping practical to help me with my own SNP calling. I first wanted to ask whether I am approaching this in the correct manner.

I have 20 strains of C. elegans which have all been sequenced be genepool in edinburgh. The files when given to me were in a <strain>_3_run_elemt_sorted_realigned.sorted.bam format which I presume corresponds roughly with the sorted.bam files on slide 54 of Victors practical RAD_mapping.ppt

I have produced a .vcf with a long list of SNPs for each strain.

What can I do to then identify non-synonymous SNPs in specific loci - baring in mind that C. elegans has a well annotated genome.

Genome viewers such as IGV can open sorted.bam files (http://www.broadinstitute.org/igv/) and show very clearly SNP calling including phred scores and other useful information, but it is quite painstaking to do this SNP by SNP on what is essentially a very graphical format. 

I thought that the best way to do this would be to look at my .vcf files in IGV or on Galaxy, but my .vcf files are around 10GB so are way over the limit for galaxy upload or viewing in IGV. 

Is the fact that my .vcf is so large alarming? and if so is this due to insufficient filtering? baring in mind that it is whole genome variant calling.

Any help would be much appreciated, 

Best wishes
Barney

Victor Soria-Carrasco

unread,
Aug 12, 2013, 7:37:10 PM8/12/13
to NGS...@googlegroups.com, Barney Wharam
Hi Barney,

I am sorry I don't have too much time to look into this this week, but I
have found this thread: http://seqanswers.com/forums/showthread.php?t=20061

In particular, the scripts available here:
http://users.ugent.be/~slvbelle/NGS/ seem to do exactly what you want.

Cheers,

Victor

__
Victor Soria-Carrasco
Post-doctoral Research Associate
Nosil Lab of Evolutionary Biology
Department of Animal and Plant Sciences
University of Sheffield
Western Bank
Sheffield S10 2TN
United Kingdom
> --
> You received this message because you are subscribed to the Google
> Groups "NGS Group APS Sheffield" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to NGSshef+u...@googlegroups.com.
> To post to this group, send an email to NGS...@googlegroups.com.
> Visit this group at http://groups.google.com/group/NGSshef.
> For more options, visit https://groups.google.com/groups/opt_out.
>
>

Reply all
Reply to author
Forward
0 new messages