whitespace in INFO field

321 views
Skip to first unread message

Will C

unread,
Aug 7, 2015, 8:58:50 AM8/7/15
to igv-help
I get this error below a lot, and it then displays nothing on the track. Other programs like Annovar skip this problem and report variants. Is there a way that IGV can accommodate this soft error and still display track? I think it would take me a long time to manually edit out the whitespace in this whole genome file. (using IGV version 2.3.40)

Error loading features for interval: chr11:88909039-89030927
htsjdk.tribble.TribbleException: The provided VCF file is malformed at approximately line number 14082: The VCF specification does not allow for whitespace in the INFO field. Offending field value was "VARTYPE_SNV;hgmd_alleles=G/A;hgmd_id=CM002936;hgmd_disease=Papillon-Lefevre syndrome;hgmd_gene=CTSC;AA=G;EUR_AF=0.85;AMR_AF=0.91;AF=0.91;AFR_AF=0.91;ASN_AF=1.00;CSQ=G|ENSG00000109861|ENST00000393302|Transcript|non_synonymous_codon|407|407|136|I/T|aTa/aCa||benign(0)|tolerated(0.61)||CTSC|||||neutral(0.004),G|ENSG00000109861|ENST00000527018|Transcript|non_synonymous_codon|328|329|110|I/T|aTa/aCa||benign(0)|tolerated(0.64)||CTSC|||||neutral(0.003),G|ENSG00000109861|ENST00000227266|Transcript|non_synonymous_codon|573|458|153|I/T|aTa/aCa||benign(0)|tolerated(0.65)|YES|CTSC|||||neutral(0.003),G|ENSG00000109861|ENST00000533865|Transcript|nc_transcript_variant|480|||||||||CTSC|||||", for input source: /Users/williamcassano/Desktop/BRIAN GENOMICS/Variations/PG0000652-BLD.snps.vcf.gz

-Will

Jim Robinson

unread,
Aug 7, 2015, 9:20:58 AM8/7/15
to igv-...@googlegroups.com
Try updating to the latest igv,  it has a new version of the htsjdk.
--

---
You received this message because you are subscribed to the Google Groups "igv-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to igv-help+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/igv-help/141e4d98-c54f-4f3e-96c5-050818aa3c10%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Will C

unread,
Aug 8, 2015, 3:04:39 PM8/8/15
to igv-help
I get the same message with IGV 2.3.59 app version for Mac that I just downloaded.

Jim Robinson

unread,
Aug 8, 2015, 8:56:58 PM8/8/15
to igv-...@googlegroups.com
We use the htsjdk for VCF parsing and its rather strict on the spec.
I'm not sure what can be done, but if you can prepare a small example
file with the problem I'll see if I can create a workaround. This
might also be a topic to raise on the htsjdk forum.

Jim

Will C

unread,
Aug 9, 2015, 8:52:18 AM8/9/15
to igv-help
Looks like spaces appear in disease names instead of underscores or dashes. This file was created by Casava in Fall 2012 (says it's VCFv4.1) and worked fine with IGV back then. Is the no whitespace rule a recent addition? Maybe I should just convert every space character to an underscore character.

Jim Robinson

unread,
Aug 9, 2015, 9:47:32 AM8/9/15
to igv-...@googlegroups.com
Hi Will,

We use the htsjdk (https://samtools.github.io/htsjdk/) for parsing
VCF, so I really can't tell you if this is a recent change or not.
You could ask on their mailing list, include a few lines of the stack
trace and/or the error message.

Jim
Reply all
Reply to author
Forward
0 new messages