--
You received this message because you are subscribed to the Google Groups "hgvs-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to hgvs-discuss+unsubscribe@googlegroups.com.
To post to this group, send email to hgvs-d...@googlegroups.com.
Visit this group at https://groups.google.com/group/hgvs-discuss.
To view this discussion on the web visit https://groups.google.com/d/msgid/hgvs-discuss/f4d08bde-cf99-4de2-95a0-c85f91a6b95f%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Hi Somak-
Thank you for sending the file of purported errors in the hgvs package. I investigated your results and want to share my interpretations. My comments are in https://docs.google.com/spreadsheets/d/1y4f68Yg_cKNLtoAPYxmUqgRCWSPFxEZDVc2LBz1TTrI/edit#gid=57717990.
Your set contains 330 tests with 45 unique transcripts in 45 unique genes. There were 23 discrepancies (defined as “manual hgvs c.” not equal to “automated hgvs c.”) in the data set:
The typo was not included in the 22 reported discrepancies and I include it here only to help explain rows in the spreadsheet. The remaining discrepancies represent two distinct issues. In the code below, I use hgvs 1.1.0, the same version that was used in your report.
1) Regarding the raised exceptions, it appears that you have selected a transcript that is not within the bounds of the gene in question. Note that the transcript your team used for the TERT variants is not returned by the relevant_transcripts()
method (see below). When one tries to use a transcript that does not span a variant region, the full text of the resulting exception is “HGVSInvalidIntervalError: Position is beyond the bounds of transcript record”. Writing succinct error messages is hard and I welcome your suggestion if another error message would have made the issue clearer.
>>> hgvs.__version__
'1.1.0'
>>> var_g = hp.parse_hgvs_variant("NC_000005.9:g.1295250G>A")
>>> am37.relevant_transcripts(var_g)
[]
>>> am37.g_to_c(var_g, "NM_198253.2")
Traceback (most recent call last):
...
HGVSInvalidIntervalError: start or end or both are beyond the bounds of transcript record
2) Regarding the discrepancies in intronic variants, hgvs is operating correctly. Normalizing a variant requires having the context of the surrounding sequence. RefSeq transcripts do not contain intronic sequence, and therefore intronic variants on RefSeq transcripts cannot be normalized. The hgvs package warns that intronic variants are not normalized and that the unnormalized variant is returned. Please note that one should not assume a particular genome reference in place of the intronic sequence. Because different genome references (e.g., GRCh37 and GRCh38) may imply different intronic sequences, a variant might normalize differently on different assumed intronic sequences.
>>> hgvs.__version__
'1.1.0'
>>> var_g = hp.parse_hgvs_variant("NC_000017.10:g.29528504_29528505insT")
>>> am37.g_to_c(var_g, "NM_001042492.2")
WARNING:hgvs.assemblymapper:Normalization of intronic variants is not supported; returning unnormalized variant
SequenceVariant(ac=NM_001042492.2, type=c, posedit=1260+1_1260+2insT)
It is important to have other groups validate hgvs and I appreciate the time that your team took to do so. I do not yet have access to the manuscript and therefore do not know whether the above explanations were included in the Discussion. Either way, it seems to me that the abstract (pubmed) implies that hgvs generated incorrect variants when, in fact, both classes of discrepancies reflect expected behavior with a sound rationale. Furthermore, these apparent discrepancies were accompanied by exceptions and warnings that explained that behavior. If you agree with this assessment, please consider changes in proofs or attaching an addendum.
I welcome your reply and any suggestions that will improve the usability of hgvs.
Best wishes,
Reece
--
To unsubscribe from this group and stop receiving emails from it, send an email to hgvs-discuss...@googlegroups.com.
To post to this group, send email to hgvs-d...@googlegroups.com.
Visit this group at https://groups.google.com/group/hgvs-discuss.
To view this discussion on the web visit https://groups.google.com/d/msgid/hgvs-discuss/f4d08bde-cf99-4de2-95a0-c85f91a6b95f%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "hgvs-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to hgvs-discuss...@googlegroups.com.
To post to this group, send email to hgvs-d...@googlegroups.com.
Visit this group at https://groups.google.com/group/hgvs-discuss.
To view this discussion on the web visit https://groups.google.com/d/msgid/hgvs-discuss/CAKNDN-fuJWYzUzwEniK_SVrAx4h9ueM2PUeKgJnEQnuLuZbL7w%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/hgvs-discuss/CAKNDN-fuJWYzUzwEniK_SVrAx4h9ueM2PUeKgJnEQnuLuZbL7w%40mail.gmail.com.
To unsubscribe from this group and stop receiving emails from it, send an email to hgvs-discuss...@googlegroups.com.