Hello,
Thank you for your interest in the UCSC Genome Browser and your question about hg19/hg38 regions.
The updated reference genome hg38 contains many improvements over the hg19 assembly, some of which may include contigs that were merged together or placed where there were previously sequence gaps. These updates can be viewed by turning on the assembly track, the 'hg19 diff' (or hg38 diff) track, the GRC Patches, and the GRC Incident tracks. For example, you can load the following session to view the region chr10:45,234,036-49,262,210 on hg38:
This area is full of changes and updates to the sequence. If you have any specific regions of interest, we may be able to offer additional information as to why they have changed.
The Genome Reference Consortium (GRC) publishes the assembly, including the updates and corrections. For example, the latest hg38 patch was GRCh38.p13, and the following blog post covers details on this patch release:
https://genomeref.blogspot.com/2019/03/grch38p13-has-been-released.html
Once a certain number of fixed sequences accumulate, the GRC would release a new and more accurate reference genome.
If you want to read more about the differences in the assemblies, there are plenty of thorough explanations online. Here is one from NCBI: https://www.ncbi.nlm.nih.gov/assembly/basics/
I hope this is helpful. Please include gen...@soe.ucsc.edu in any replies to ensure visibility by the team. All messages sent to that address are archived on our public forum. If your question includes sensitive information, you may send it instead to genom...@soe.ucsc.edu.
Gerardo Perez
UCSC Genomics Institute
--
---
You received this message because you are subscribed to the Google Groups "UCSC Genome Browser Public Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to genome+un...@soe.ucsc.edu.
To view this discussion on the web visit https://groups.google.com/a/soe.ucsc.edu/d/msgid/genome/CAMt%3DZPmT2Ke2-e4AkUWS%2B2RVjveALrVEsWuLdP4RnYVNiMCAYQ%40mail.gmail.com.
Hello Cenny,
Thank you for your interest in the UCSC Genome Browser and your follow-up question about the hg19/hg38 regions.
Yes, part of the hg19 chr22 EWSR1 gene was dropped in the hg38 assembly. In the image below or session, under the label “GRC Incident Database”, we can see an annotation meaning that there has been a reported and resolved assembly problem, HG-346, in the region of the EWSR1 gene. Under the label “Contigs Dropped or Changed from GRCh37(hg19) to GRCh38(hg38)”, we can see a brown annotation in the region of the EWSR1 gene, which means different portions of this same contig used in the construction of hg38 and hg19 assemblies.
http://genome.ucsc.edu/s/gperez2/hg19_contigs_hg38_MLQ
We don’t see annotations for the GRC Incident track or the Hg38 Diff track in the hg19 chr11 FLI gene. Yes, the FLI gene should not have differences in the hg38 assembly.
I hope this is helpful. Please include gen...@soe.ucsc.edu in any replies to ensure visibility by the team. All messages sent to that address are archived on our public forum. If your question includes sensitive information, you may send it instead to genom...@soe.ucsc.edu.
Gerardo Perez
UCSC Genomics Institute