Hi Kristine,
Thank you for your question about finding the exon only length
of a gene. It is unclear from the link you shared what exactly
you want to find the length of, but the 51,320 number you are
seeing is the size of the genome (in base pairs) that you are
currently viewing in the Genome Browser window, not the length
of an individual gene.
That said, you can still use the Genome Browser to find the total
length of the exons in an individual gene using the Table Browser
tool: http://genome.ucsc.edu/cgi-bin/hgTables.
From the Table Browser page, make the following selections:
clade: Mammal
genome: Human
assembly: hg38
group: Genes and Gene prediction Tracks
track: GENCODE V22
table: knownGene
identifiers: paste or upload a list of gene names or ID's
output format: selected fields from primary and related tables
Then click "get output". From the get output page, in the section
"Select fields from hg38.knownGene" choose exonStarts, and
exonEnds. In the "hg38.kgXref fields" section, choose "geneSymbol"
and then click "get output".
For each exonStart and exonEnds pair, take the difference of
exonEnds-exonStart and sum them, this will give you the total
exon length.
Here is a similar previously answered mailing list question with an
example: https://groups.google.com/a/soe.ucsc.edu/d/msg/genome/l4Roei7mdxg/ljAaFlsXcR0J
For more information on the table browser please see the Table
Browser User's Guide at:
https://genome.ucsc.edu/goldenPath/help/hgTablesHelp.html
You can also search our archived mailing list at
https://groups.google.com/a/soe.ucsc.edu/forum/#!forum/genome
to see similar questions that have been asked previously.
Thank you again for your inquiry and using the UCSC Genome Browser.
If you have any further questions, please reply to gen...@soe.ucsc.edu.
All messages sent to that address are archived on a publicly-accessible
forum. If your question includes sensitive data, you may send it instead
to genom...@soe.ucsc.edu.
--
---
You received this message because you are subscribed to the Google Groups "UCSC Genome Browser discussion list" group.
To unsubscribe from this group and stop receiving emails from it, send an email to genome+un...@soe.ucsc.edu.
Hi Kristine,
Thank you for your question about finding a list of 5000 mRNA sequences lengths.
If you have a list of UC ID's like the one from your example: uc001tar.3, then
you can use the Table Browser to select the mRNA length from the kgTargetAli table.
Note that if you have a mix of ID's, some from hg19 and some from hg38, then you
will have to perform two separate queries:
1. Head to the table browser http://genome.ucsc.edu/cgi-bin/hgTables
2. Make the following selections:
clade: Mammal
genome: Human
assembly: hg38, or hg19
group: All Tables
table: knownGene
identifiers: paste or upload your list of 5000 ID's
output format: selected fields from primary and related tables
3. Click "get output".
4. In the "Select Fields from db.knownGene" section, where db is either hg38 or hg19, check the checkbox "name".
5. In the "linked tables" section, check the checkbox of the row containing "hg38", "kgTargetAli", "Summary info about a patSpace alignment".
6. Click "allow selections from checked tables".
7. In the "db.kgTargetAli fields" section, check the checkbox qSize.
8. Click "get output".
You will now have a list of the form:
#hg38.knownGene.name hg38.kgTargetAli.qSize uc001xjt.4 5351 uc059cqu.1 565
Thank you again for your inquiry and using the UCSC Genome Browser.
If you have any further questions, please reply to gen...@soe.ucsc.edu.
All messages sent to that address are archived on a publicly-accessible
forum. If your question includes sensitive data, you may send it instead
to genom...@soe.ucsc.edu.