Hello,
I have used platypus to call genotypes in Genome In a Bottle sample HG001/NA12878 and I get the following line in my output:
1 8446082 . ATTTTTTTT A,ATTTTTTTTTTT,TTTTTTTTT 985 SC;HapScore;badReads BRF=0.0;FR=0.5000,0.4989,0.5000;HP=21;HapScore=12;MGOF=20;MMLQ=7;MQ=70.0;NF=4,4,1;NR=7,7,3;PP=363,23,985;QD=33.2727272727;SC=ATATATATATATTTTTTTTTT;SbPval=0.54;Source=Platypus;TC=12;TCF=4;TCR=8;TR=11,11,4;WE=8446102;WS=8446072 GT:GL:GOF:GQ:NR:NV 1/2:-1,-1,-1:20:4:12,12,15:11,11,4
Given that the genotype indicates that only alternate alleles 1 and 2 are present, I'm not sure why platypus returns a third alternate allele. Any help interpreting this output or identifying it as bug would be appreciated. Thanks!
Best,
Katie
--
You received this message because you are subscribed to the Google Groups "Platypus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to platypus-users+unsubscribe@googlegroups.com.
To post to this group, send email to platypus-users@googlegroups.com.
Visit this group at https://groups.google.com/group/platypus-users.
To view this discussion on the web, visit https://groups.google.com/d/msgid/platypus-users/ea10bd2d-9610-4fd8-bc24-d97826311f98%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Hi Katherine,Platypus will list all the alleles that have more than a certain amount of support in the data (default is minimum of 2 supporting reads and phred-scaled posterior > 5), which can lead to > 2 alternate alleles being listed. This is not a bug. It looks like this region contains a long T homopolymer, which will probably lead to a large number of sequencing indel errors, on top of whatever actual variation exists. Calling indels correctly in these regions is difficult and error-prone, but the best way to interpret this is to use the called genotype and ignore anything else. So in this case, Platypus has called the first 2 alternate alleles (the 1/2 genotype), and the 3rd should be ignored.Kind regards,
Andy
On Fri, Mar 31, 2017 at 6:41 PM, Katherine Wilkins <kawilkin...@gmail.com> wrote:
Hello,
I have used platypus to call genotypes in Genome In a Bottle sample HG001/NA12878 and I get the following line in my output:
1 8446082 . ATTTTTTTT A,ATTTTTTTTTTT,TTTTTTTTT 985 SC;HapScore;badReads BRF=0.0;FR=0.5000,0.4989,0.5000;HP=21;HapScore=12;MGOF=20;MMLQ=7;MQ=70.0;NF=4,4,1;NR=7,7,3;PP=363,23,985;QD=33.2727272727;SC=ATATATATATATTTTTTTTTT;SbPval=0.54;Source=Platypus;TC=12;TCF=4;TCR=8;TR=11,11,4;WE=8446102;WS=8446072 GT:GL:GOF:GQ:NR:NV 1/2:-1,-1,-1:20:4:12,12,15:11,11,4Given that the genotype indicates that only alternate alleles 1 and 2 are present, I'm not sure why platypus returns a third alternate allele. Any help interpreting this output or identifying it as bug would be appreciated. Thanks!
Best,
Katie
--
You received this message because you are subscribed to the Google Groups "Platypus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to platypus-user...@googlegroups.com.
To post to this group, send email to platypu...@googlegroups.com.
Visit this group at https://groups.google.com/group/platypus-users.
To view this discussion on the web, visit https://groups.google.com/d/msgid/platypus-users/ea10bd2d-9610-4fd8-bc24-d97826311f98%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.