Uploading *.vcf data into IGV for the first time

543 views
Skip to first unread message

Lili Peng

unread,
Apr 21, 2016, 1:30:17 PM4/21/16
to igv-help
Hi,

I am trying to load the data into the 64-bit Java version of IGV but get the following message:

"An index file for ~\ExAC.r0.3.1.sites.vep.vcf could not be located. An index is required to view files of this size.   Click "Go" to create one now."

The *.vcf file is ~33GB, by the way.

So, I click 'Go' but IGV has been churning away at it for the past 7 min.

Any advice for a newbie would be greatly appreciated,
lili




James Robinson

unread,
Apr 21, 2016, 2:01:03 PM4/21/16
to igv-...@googlegroups.com
It might take a few hours to index a 33GB file.  I highly recommend you use tabix to do the indexing rather than IGV,  it will also compress it.  No guarantees you’ll be able to load a VCF that size even after indexing.

Jim


--

---
You received this message because you are subscribed to the Google Groups "igv-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to igv-help+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/igv-help/a5997544-c036-4197-83f3-de7182c9d93a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Lili Peng

unread,
Apr 21, 2016, 3:46:37 PM4/21/16
to igv-help
Thanks for the advice on using tabix, as I've already downloaded the program onto my UNIX environment.

Would you know if there are any examples of indexing *.vcf genomics data using tabix?  The manual (http://www.htslib.org/doc/tabix.html) provides only a few examples.  It would be really helpful if I were to see more examples, especially complex and descriptive ones.

Thanks again,
Lili

Jim Robinson

unread,
Apr 21, 2016, 9:14:30 PM4/21/16
to igv-...@googlegroups.com
I don't have anything to add beyond what's in the manual, which you've already found.   There's nothing really complex about it.  The VCF should be sorted by start position,  then bgzip it and run tabix on the bgzipped file.


--

---
You received this message because you are subscribed to the Google Groups "igv-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to igv-help+u...@googlegroups.com.

Lili Peng

unread,
Apr 21, 2016, 10:39:00 PM4/21/16
to igv-help
I'd used tabix to generate a *.tbi file, but it's still unclear to me how to index the *.gz file using the *.tbi file with tabix.  Which is why I'd asked if there are more detailed examples.  I've been searching UCSC and other websites for specific steps, but each is all written involving a tool/web app specific to that website.

Jim Robinson

unread,
Apr 21, 2016, 11:01:42 PM4/21/16
to igv-...@googlegroups.com
Hi,  I don't understand your question,  the .tbi file is the index.  If you have it just load the vcf.gz file into IGV or any other program that supports tabix indexes.   Keep the vcf.gz and vcf.gz.tbi files together in the same directory.
Reply all
Reply to author
Forward
0 new messages