Extracting frequency allele threshold information.

149 views
Skip to first unread message

Jane Wilson

unread,
Jun 2, 2015, 9:54:21 PM6/2/15
to igv-...@googlegroups.com

Hello,

  I was wondering if there is any way of extracting the information about mutations/deviations in the two pop-up windows in the screen shot below.  We are analysing viral genomes, and would like to plot the percentage per site in the genome of all deviations from the reference (insertions + deletions + nucleotide changes).

If you had any advice on how to extract this information, I’d be very grateful!  We are using Illumina HiSeq data, mapped to the reference using bowtie or BWA.

Thanks very much,

Jane


Jim Robinson

unread,
Jun 2, 2015, 9:58:14 PM6/2/15
to igv-...@googlegroups.com
Hi,

No, unfortunately there is not.  IGV is primarily a visualizer, we leave analysis to other tools (samtools, bedtools, Picard, ...).   One could argue this simple counting is not analysis, but you can't do it currently.  You can use igvtools to count nucleotides by position, but that does not get you all the way.

I suggest you look at samtools mpileup to see what it has, and maybe post your question more broadly in nextgen analysis forums.

Jim

--

---
You received this message because you are subscribed to the Google Groups "igv-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to igv-help+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/igv-help/a2b4d562-aeca-4aae-8a7d-80c6582be5e7%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Jane Wilson

unread,
Jun 2, 2015, 11:06:10 PM6/2/15
to igv-...@googlegroups.com
Hi Jim,
  thanks very much for your reply.  I'll look into mpileup.
Thanks again,
Jane

Rafael Kroon Campos

unread,
Jun 8, 2015, 11:58:37 PM6/8/15
to igv-...@googlegroups.com
Hello,

I have a similar question, but I just want the count per nucleotide. When I hover over my data with the mouse in IGV I can see the total count for each nucleotide in the reference sequence. Is it possible to export this data in a list format to excel so I could have the total count in one column and each nucleotide of the reference sequence in another? The viral genome I am looking at is 11000 bases long.


For example:

A- total count= 300
T-total count= 500
G- total count =500
A= total count =550

 thank you,

Rafael Campos 

Jim Robinson

unread,
Jun 9, 2015, 12:28:31 AM6/9/15
to igv-...@googlegroups.com
Hi,

You can do this with igvtools from the command line, see the user documentation for the "count" command. 
--

---
You received this message because you are subscribed to the Google Groups "igv-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to igv-help+u...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages