Dear HGVS team,
I would like to submit a particular point of ambiguity in the HGVS nomenclature guidelines concerning insertions that encode a translation stop codon.
Example 1:
This insertion results in the addition of SerTer after Asp358. The translation stop codon is encoded in the insertion.
The Variant Validator, Mutalyzer and VEP tools represent this alteration as a frameshift:
p.(Gln359Serfs*2)
Example 2:
Modified from example 1 by adding one nucleotide:
Since the nucleotide is added after the encoded termination signal, it
does not impact the actual protein modification. However, the
insertion's total length is now a multiple of three, leading to its
classification as an insertion rather than a frameshift:
p.(Asp358_Gln359insSer*)
These two examples highlight how, for these prediction tools, the "multiple of three" rule plays a determinant role for the frameshift determination, while the presence of an encoded stop codon does not. In contrast, the HGVS guidelines do not involve the "multiple of three rule". Instead, they involve the open reading frame status of the inserted sequence.
The guidelines define frameshift as follows:
"Frameshift: a sequence change between the translation initiation
(start) and termination (stop) codon where, compared to a reference
sequence, translation shifts to another reading frame."
From the Frameshift, Insertions, Deletions-Insertions, and Duplications sections, it is stated that: inserted sequences on DNA or RNA level that:
Notably, the emphasis on the term “after” (in the text) suggests that
it is crucial for the classification that the sequence after the
insertion continue to be translated.
The two insertions reported above share the following characteristics:
Without involving the "multiple of three" rule, It is unclear whether NM_013976.5(GCDH):c.1074_1075insAGTTGAAGGAC should be represented as a frameshift p.(Gln359Serfs*2) or simply as an insertion p.(Asp358_Gln359insSer*).
Since none of the examples provided in the guidelines directly address
this scenario, I suggest adding the "non-multiple-of-three" case as a
new example in the Frameshift section. This would help for the
determination of similar situations, including Duplications, and
Deletion-Insertions, reducing subjective interpretation and ensuring
consistent application of the HGVS nomenclature.
Kind Regards,
David Hernandez