Dear Elisheva,
Thank you for using the UCSC Genome Browser and your question about the differences between hg19 RefSeq exons and hg38 RefSeq exons.
The short answer is that hg19 does not include a kind of transcripts called predicted transcripts so it has many fewer annotations. These predicted transcripts start with the letter X and the following MySQL query (name not like 'X%';" ) on the hg38 and hg19 databases will show a more expected number of transcripts for apples-to-apples comparisons by excluding these annotations:
A better choice over ncbiRefSeq would be ncbiRefSeqCurated that in essence performs the above exclusion of X-named item when the table was built (note the query does not include name not like 'X%';" ). The ncbiRefSeqCurated set includes only those annotations whose accessions begin with NM, NR, NP or YP (NP and YP are used only for protein-coding genes on the mitochondrion; YP is used for human only).
Note that for hg19 ncbiRefSeq and ncbiRefSeqCurated are similar since predicted items are not included in ncbiRefSeq.
Part of the explanation for why hg19 does not include these items is that XM_ and XR_ annotations are predicted by Gnomon (a gene predictor https://www.ncbi.nlm.nih.gov/genome/annotation_euk/gnomon/) and for hg19 it was shared that NCBI can not run the process without the EST and mRNA alignments and all the other inputs Gnomon requires.
Thank you again for your inquiry and for using the UCSC Genome Browser. If you have any further public questions, please reply to gen...@soe.ucsc.edu. All messages sent to that address are archived on a publicly-accessible forum. If your question includes sensitive data, you may send it instead to genom...@soe.ucsc.edu.
All the best,
--
---
You received this message because you are subscribed to the Google Groups "UCSC Genome Browser Public Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to genome+un...@soe.ucsc.edu.
To view this discussion on the web visit https://groups.google.com/a/soe.ucsc.edu/d/msgid/genome/CALfxY1B7FtQpJb4YT0Cd8CeWCiXVYjQUsdfF01BCAS%2BEQ9R_iQ%40mail.gmail.com.
Matthew Speir
UCSC Cell Browser, Quality Assurance and Data Wrangler
Human Cell Atlas, User Experience Researcher
UCSC Genome Browser, User Support
--
---
You received this message because you are subscribed to the Google Groups "UCSC Genome Browser Public Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to genome+un...@soe.ucsc.edu.
To view this discussion on the web visit https://groups.google.com/a/soe.ucsc.edu/d/msgid/genome/CALfxY1CC0a9N%3DWoaJ4hLhpSfTOk0eKLtiVOr4ybLcBSjcJ41wA%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/a/soe.ucsc.edu/d/msgid/genome/CALfxY1Aoyj16R1E6J31nZCCrfBxL7fQd%2BCK6NQnXWDpcwEhipQ%40mail.gmail.com.