We are pleased to announce the release of five new GENCODE Gene tracks corresponding to GENCODE release V43 for human and VM32 for mouse. While all of the tracks are built from the GENCODE release, they fall into two categories. Two of these tracks, GENCODE V43 (hg38) and GENCODE VM32 (mm39) were built with our knownGene pipeline and are now the default gene tracks for those assemblies. The knownGene pipeline builds extensive associations from the annotations and allows us to show additional metadata for each item as well as link to external resources. The track description pages for these tracks contain options for configuring the display such as also showing non-coding genes, splice variants, and pseudogenes. Different tags and labels may also be toggled.
The remaining three tracks were each nested within our GENCODE Versions superTrack for each of the three assemblies: hg19, hg38, and mm39. For human, the GENCODE V43 annotations were mapped to hg38 and then back-mapped to the hg19 assembly. New GENCODE releases now have an assigned rank for transcripts within the gene. The transcript rank may be used to filter the number of transcripts displayed in a principled manner. More details about transcript ranking can be found on the track description page. For all three assemblies, the gene sets contain the following tracks:
The hg38 and mm39 assemblies also include the following track:
Below is a summary of the contents found in each release. For more details visit the GENCODE site.
GENCODE v43 Release Stats | |||
---|---|---|---|
Genes | Observed | Transcripts | Observed |
Protein-coding genes | 19,393 | Protein-coding transcripts | 89,411 |
Long non-coding RNA genes | 19,928 | - full length protein-coding | 64,004 |
Small non-coding RNA genes | 7,566 | - partial length protein-coding | 25,407 |
Pseudogenes | 14,737 | Nonsense mediated decay transcripts | 21,354 |
Immunoglobulin/T-cell receptor gene segments | 410 | Long non-coding RNA loci transcripts | 58,023 |
Total No of distinct translations | 65,519 | Genes that have more than one distinct translations | 13,618 |
GENCODE VM32 Release Stats | |||
---|---|---|---|
Genes | Observed | Transcripts | Observed |
Protein-coding genes | 21,565 | Protein-coding transcripts | 58,913 |
Long non-coding RNA genes | 14,834 | - full length protein-coding | 45,219 |
Small non-coding RNA genes | 6,105 | - partial length protein-coding | 13,694 |
Pseudogenes | 13,722 | Nonsense mediated decay transcripts | 7,211 |
Immunoglobulin/T-cell receptor gene segments | 701 | Long non-coding RNA loci transcripts | 26,421 |
Total No of distinct translations | 45,163 | Genes that have more than one distinct translations | 10,914 |
We would like to thank the GENCODE project for providing these annotations.