I have a question regarding HGVS RNA nomenclature for pseudoexon inclusion events.
To illustrate my question, I will use the variant NC_000012.12:g.102854533A>C NM_000277.3:c.706+603T>G reported in the paper DOI:10.1016/j.jmoldx.2023.02.001
This deep intronic variant results in the inclusion of a pseudoexon derived from intronic sequence approximately corresponding to c.706+534 to c.706+647. The inserted pseudoexon contains the causative variant itself, meaning that the sequence present in the RNA differs from the reference intronic sequence at position c.706+603.
After reading the HGVS recommendations and examples, I am still unsure how this situation should be described at the RNA level. My understanding is that the consequence could be written as:
r.706_707ins[706+534_c.706+647;706+603T>G]
because the inserted sequence contains the nucleotide substitution and is therefore not identical to the reference intronic sequence.
However, I am not sure whether sequence differences present within the inserted fragment should be incorporated directly into the RNA description or handled in another way.
Could someone clarify what the HGVS-compliant RNA description would be in this situation?
Thank you very much for your help.
Thank you very much for taking the time to explain the correct notation and for pointing me to the LOVD database. I had not noticed that the RNA nomenclature was already specified in there.
Best regards,
Obdulia