extracting the mRNA sequences of the canonical isoforms

69 views
Skip to first unread message

Bogdan Tanasa

unread,
Dec 1, 2020, 11:57:39 AM12/1/20
to gen...@soe.ucsc.edu
Dear all, 

would you please advise, what is the most reliable way to extract the mRNA sequences of the canonical RefSeq genes in human or mouse genomes ?

thanks a lot, 

bogdan

Luis Nassar

unread,
Dec 4, 2020, 6:23:45 PM12/4/20
to Bogdan Tanasa, gen...@soe.ucsc.edu

Hello, Bogdan.

Thank you for your continued interest in the Genome Browser.

For both mm10 and hg38 you can use the Table Browser (http://genome.ucsc.edu/cgi-bin/hgTables), making the selections

group: Genes and Gene Predictions
group: GENCODE VM23 (mouse) or GENCODE v32 (human)
group: knownCanonical

Then:

output format: Sequence

To extract the mRNA sequences for a canonical isoform for each gene. Note that if you are using RefSeq, you can use the RefSeq Select+Mane track (http://genome.ucsc.edu/cgi-bin/hgTrackUi?db=hg38&c=chrX&g=refSeqComposite) which are a list of canonical transcripts released by NCBI. You can follow the steps above to extract the sequence, or get the raw files directly from NCBI (https://www.ncbi.nlm.nih.gov/refseq/refseq_select/).

Secondary structure information can be found in the UniProt track (http://genome.ucsc.edu/cgi-bin/hgTrackUi?db=hg38&g=uniprot), specifically the Structure subtrack which contains primary and secondary structure annotations. These annotations are available for both hg38 and mm10.

I hope this is helpful. Please include gen...@soe.ucsc.edu in any replies to ensure visibility by the team. All messages sent to that address are archived on our public forum. If your question includes sensitive information, you may send it instead to genom...@soe.ucsc.edu.

Lou Nassar
UCSC Genomics Institute


--

---
You received this message because you are subscribed to the Google Groups "UCSC Genome Browser Public Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to genome+un...@soe.ucsc.edu.
To view this discussion on the web visit https://groups.google.com/a/soe.ucsc.edu/d/msgid/genome/CA%2BJEM00GG2mAh%3D_mKtxt0orUvfPozD1J2DFXdgyi_pFTysPEcQ%40mail.gmail.com.

Bogdan Tanasa

unread,
Dec 4, 2020, 7:27:21 PM12/4/20
to Luis Nassar, gen...@soe.ucsc.edu
Dear Luis, 

many many thanks for your email, valuable information, wonderful help ; have a good weekend !
with big appreciation, 

-- bogdan
Reply all
Reply to author
Forward
0 new messages