exporting dosage in VCF file

131 views
Skip to first unread message

Sarah Medland

unread,
Jul 21, 2017, 3:58:23 AM7/21/17
to plink2-users
Hi

I'm testing out the vcf export options to generate minimac3 style vcf files using  "--export vcf vcf-dosage=DS "

The output is patchy - most snp*individual entries have GT only some have GT:DS and some have DS only (screen shot of a section of data below showing the various combinations)
This output will not be readable by downstream programs as a minimac3 style vcf.

Any suggestions on how to obtain consistent GT:DS data from plink2?

thanks
Sarah



Auto Generated Inline Image 1

Christopher Chang

unread,
Jul 21, 2017, 11:04:14 AM7/21/17
to plink2-users
Hmm, the current output should conform to the VCF spec?  (Omitted field = missing value, and DS is omitted if and only if it's an integer that can be inferred exactly from GT.)

With that said, I realize that some minimac3 VCF readers may not be able to handle this.  It would substantially increase the size of the output file, but I can add an option this weekend to dump redundant DS values.

(With that said, the current output should conform to the VCF spec?)

Sarah Medland

unread,
Jul 22, 2017, 4:15:56 AM7/22/17
to plink2-users
That would be great - I realise it is going to increase size a lot but there are a number of tools that can't deal with this.

If you are willing to consider adding another option, the ability to output a vcf that contains DS but not GT is also readable by many programs and would help keep the size down.

best wishes
Sarah

Christopher Chang

unread,
Jul 23, 2017, 9:48:29 PM7/23/17
to plink2-users
vcf-dosage=DS-force is now implemented; let me know if you run into any problems with it.

(I don't plan to add any sort of omit-GT option, since that creates problems on chrX...)

Sarah Medland

unread,
Jul 24, 2017, 9:43:14 PM7/24/17
to plink2-users
Wonderful - thanks!

Reply all
Reply to author
Forward
0 new messages