Dear Ketaki,
sorry to be slow with this reply; an answer is not that simple.
1) you do not give the reference sequences used: I assume you mean the variant NC_000019.9:g.(45409925_45411016)_(45411210_45411789)dup NM_000041.2:c.(43+1_44-1)_(236+1_237-1)dup. Note in your description "g." was missing.
2) the format you suggest, NC_000019.9:g.(45409925_45411016)_(45411210_45411789)[3], cannot be used in HGVS nomenclature. An essential element of the HGVS
nomenclature is that it describes the "differences compared to a reference sequence". The difference in the example is one additional copy, so going from 1 to 2 copies.
3) I agree
the description NC_000019.9:g.(45409925_45411016)_(45411210_45411789)dup, by using "dup", assumes the extra copy of the sequence is "
in tandem" with the original copy, a conclusion you cannot conclude when the break point has not been sequenced. The topic is listed on the Recommendations > DNA > duplication page (see
https://hgvs-nomenclature.org/stable/recommendations/DNA/duplication/), where the format
g.?_?ins[NC_000019.9:g.(45409925_45411016)_(45411210_45411789)
] is suggested as an alternative description. This format indicates that an extra copy was identified but that the location of the additional copy is not known and could be anywhere in the genome. Best regards,
Johan den Dunnen
HUGO HGVS Variant Nomenclature Committee (HVNC)