liftover issue from hg38-to-hg19

366 views
Skip to first unread message

Yasmina Mekki

unread,
Oct 3, 2022, 5:52:49 PM10/3/22
to gen...@soe.ucsc.edu
Hello,

I am trying to lift my data from one Genome Reference to another (hg38 to hg19). However, I am facing something strange as for different variants, the new position (on hg19) is on a different chromosome than in hg38. Please see this link that illustrate an example:

Is this expected?

Thank you,
Yasmina


Luis Nassar

unread,
Oct 5, 2022, 7:11:37 PM10/5/22
to Yasmina Mekki, gen...@soe.ucsc.edu
Hello, Yasmina.

Thank you for your interest in the Genome Browser.

We would need to evaluate the oddities on a case by case basis, but the location of the variant you provided on hg38 (17:138824) is located on a completely new sequence that was added to hg38 and does not exist on hg19 (https://www.ncbi.nlm.nih.gov/nuccore/AC240565.4).

You can see the following session which shows the entire new contig in place (http://genome.ucsc.edu/s/Lou/RM30088). Looking at the self-chain track, which is what liftOver is built on, it looks like there is a fairly high quality sequence match between this new region and the location on hg19 chr11, which is why liftOver ultimately matches it there. That being said, it is an entirely novel region on hg38. Sometimes the GRCh moves a contig to a different location in assembly updates (even to different chromosomes), however when that happens it is annotated as such. That is not the case here, so it would appear there is no exact corresponding location of this variant on hg19.

If your variant has a dbSNP rsID it is always recommended you attempt to pick up the corresponding coordinates on another assembly using the ID and not liftOver (http://genome.ucsc.edu/FAQ/FAQreleases.html#snpConversion). That being said, after spot checking some variants located in this new contig I don't believe any have corresponding hg19 coordinates, likely due to it being novel sequence.

I hope this is helpful. Please include gen...@soe.ucsc.edu in any replies to ensure visibility by the team. All messages sent to that address are archived on our public forum. If your question includes sensitive information, you may send it instead to genom...@soe.ucsc.edu.

Lou Nassar
UCSC Genomics Institute

--

---
You received this message because you are subscribed to the Google Groups "UCSC Genome Browser Public Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to genome+un...@soe.ucsc.edu.
To view this discussion on the web visit https://groups.google.com/a/soe.ucsc.edu/d/msgid/genome/CAJ0W%3DAV5S_NPKbKdOBvAXcaovQg%3D5sJKwC2EUdf%3Di%3Dvf-aASKg%40mail.gmail.com.
Reply all
Reply to author
Forward
0 new messages