Hi all
There are two exons, 5617 <- 5737 and 5793 <- 5953 with phases 0 and 2, respectively. Both exons are in the reverse strand.
From the spec, phase indicates "the number of bases that should be removed from the beginning of this feature to reach the first base of the next codon", and for "reverse strand features, phase is counted from the end field".
In the case of the 3' exon (5793 <- 5953), the end field (the 5th column) is 5953.
The base at the end field is the first base of the translated CDS, so there should be no bases removed "to reach the first base of the next codon". I suggest that this phase should be 0, not 2.
The output gff3 is correct if "the number of bases that should be removed from the beginning of this feature to reach the first base of the next codon" is measured from the 'left-hand' end of this feature (the start field) rather than the end field.
Has anybody else ran into this problem or am I misreading the gff3 spec?
Rob Syme
PhD Student
Curtin University