Hi HGVS group,
At Epic we are working on a project to add support for labs to report repeat expansion variants with inequality and ranges of repeats for a given nucleotide sequence present in the gene.
For example, running Southern Blot Analysis or other gel electrophoresis testing for Huntington’s (HTT gene) may indicate that there are at least 42 repeats of the CAG nucleotide sequence, would we represent this as NG_009378.1:g.3074_3076CAG[>42], NG_009378.1:g.3074_3076CAG[(>42)] or some other format?
Another tangentially related question I have is whether you’ve encountered questions about reporting variants that have some indeterminate number of repeats (ex: Instead of >42 the lab result indicates that the nucleotides are “expanded”)? This came up during our conversations with some labs and we’re trying to get a handle on what exactly would make sense for reporting a variant name against this type of result (would it be valid to report NG_009378.1:g.3074_3076CAG[(expanded)]?)
Lastly, if we document a range of values (ex: 8-12 repeats) is this the correct nomenclature? Based on what I’ve seen for general recommendations this seems correct, but I want to confirm.
NG_009378.1:g.3074_3076CAG[(8_12)]
Any additional insight or recommendations would be appreciated!
Best,
Steven
Steven Cordes
|
Software Developer