VCF generated by VCFtools (v0.1.13) not readable in TASSEL 5

265 views
Skip to first unread message

Jessica May

unread,
Nov 20, 2014, 1:49:12 AM11/20/14
to tas...@googlegroups.com
Hi All,
I am using TASSEL5 latest build. Standalone downloaded from:
https://bitbucket.org/tasseladmin/tassel-5-standalone/downloads#tag-downloads
I am trying to load VCF File generated by  VCFtools (v0.1.13) after filtering.
It says error in number of sites for some texa.
I believe everything is fine as PLINK and VCFtools are able to read file.

Any idea?

regards
May

Terry Casstevens

unread,
Nov 20, 2014, 12:01:33 PM11/20/14
to Tassel User Group
Hi Jessica,

Does you file have values ".:.:.:."? We have an open issue regarding
that. I think that should be imported as unknown values. Do you know
if that is true?

You can send me your file privately if you want.

Best,

Terry
> --
> You received this message because you are subscribed to the Google Groups
> "TASSEL - Trait Analysis by Association, Evolution and Linkage" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tassel+un...@googlegroups.com.
> To post to this group, send email to tas...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tassel/CALBratjNW5tTg94PtQzyx7Vsw9z8-vzqxzWcX-_qFaEUFSbyzQ%40mail.gmail.com.
> For more options, visit https://groups.google.com/d/optout.

Jessica May

unread,
Nov 20, 2014, 12:41:11 PM11/20/14
to tas...@googlegroups.com
Thanks Terry,
I have not peep in to file.
The error what i am getting is that a particular row do not have same number of sites what are expected.
i will try to subset file and send you as it is very big.

regards
May


Ginnie M

unread,
Dec 18, 2014, 4:13:35 PM12/18/14
to tas...@googlegroups.com
Hi,
I'm having a similar issue. I'm using TASSEL5 to convert a HDF5 file into VCF format for use downstream (in Beagle4.0). While the conversion seems to work, Beagle does not approve of TASSEL's notation for deletions:
Exception in thread "main" java.lang.IllegalArgumentException: ALT allele [-] is not a sequence of A, C, T, G, N, or '*' characters
VCF seems to want 'DB' for a deletion (though I'm not sure if Beagle will be happy with this). Is there a line I could change in TASSEL5 to fix/test this?

Terry Casstevens

unread,
Dec 18, 2014, 4:28:31 PM12/18/14
to Tassel User Group
Are you saying Tassel puts - for missing data?

What does VCF consider missing? N, *, DB?

What does Beagle use?
> https://groups.google.com/d/msgid/tassel/3814e12b-2550-47e4-8eab-45b8debcb70c%40googlegroups.com.

Ginnie M

unread,
Dec 18, 2014, 4:47:19 PM12/18/14
to tas...@googlegroups.com
Tassel seems to be putting a '-' for what the TASSEL gui codes as '-'/deletion, not missing. VCF and Beagle seem to want, I think, a '.' for the alternate allele when there's a single base deletion. I'm not entirely clear on this--this is just what I'm piecing together (possibly incorrectly).

You received this message because you are subscribed to a topic in the Google Groups "TASSEL - Trait Analysis by Association, Evolution and Linkage" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tassel/hd1aTRxz3fE/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tassel+un...@googlegroups.com.

To post to this group, send email to tas...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages