Hi,
We have run platypus on a trio and are having problems with certain variants the output. Here is an example: two lines from the output VCF:
1 804107 . TCCCTAGAGACAA CCACTAGAAACAT 367 PASS BRF=0.26;FR=0.1667;HP=3;HapScore=2;MGOF=47;MMLQ=41;MQ=51.56;NF=4;NR=0;PP=3
67;QD=110.117;SC=CAGAACACAATCCCTAGAGAC;SbPval=1;Source=Platypus;TC=76;TCF=48;TCR=28;TR=4;WE=804127;WS=804097 GT:GL:GOF:GQ:NR:NV 0/0:0,-9.03,
-300:8:90:28:0 0/0:0,-8.13,-300:7:81:28:0 1/0:-30.2,0,-151.88:47:99:20:4
1 804115 . G A 2965 PASS BRF=0.26;FR=0.8333;HP=2;HapScore=2;MGOF=69;MMLQ=37;MQ=50.49;NF=44;NR=24;PP=2965;QD=20;SC=AATCCCTAGAGACAACCTACC;SbPval=0.51;Source=Platypus;TC=69;TCF=45;TCR=24;TR=68;WE=804127;WS=804097 GT:GL:GOF:GQ:NR:NV 1/1:-102.8,-7.8,0:8:78:26:25 1/1:-102.4,-7.77,0:7:78:27:27 1/1:-63.9,-4.52,0:69:45:16:16
So the first variant is a MNP that overlaps the SNP in the second line. We are trying to decompose and normalise the MNPs so that we can annotate them against other data sources. Therefore we would like to express this MNP as 4 SNPs: 1:804107T>C 1:804109C>A 1:804115G>A (which is the same SNP as in the second line above) and 1:804119A>T. The problem is that the first line says that individuals 1 and 2 are homozygous reference and individual 3 is heterozygous for 1:804115G>A, but the second line says all individuals are homozygous for the same SNP.
Has anyone else come across this? Is there a simple solution?
Many thanks, Keren