downloading promoter sequence for string search

4 views
Skip to first unread message

Jessilyn Dunn

unread,
Sep 12, 2014, 11:06:33 AM9/12/14
to gen...@soe.ucsc.edu

Hello,

I am trying to use the UCSC browser to download the DNA sequence for gene promoters (+/-1kb from TxStart).

To do this, I planned to download the sequences using the table browser (RefSeq genes track) by defining +/-1kb from the TxStart using the "region," but I do not see a way to define this. Alternatively I created a "position" file in bed format containing the genomic coordinates of all promoters, but it has ~34,000 lines and it appears the limit for user-defined regions is 1,000. There must be a better way to do this, but I'm not sure how.

My overall goal is to get all of the gene accession #s for genes with a specific transcription factor binding sequence in their promoter. I also see that there is a potentially a method under "filter constraints" where I could do a string search within a sequence set, rather than exporting the sequences and writing code to do this.

Any insight you can provide on better methods would be greatly appreciated!
Thank you very much!
Sincerely,
Jessilyn


-- 
Jessilyn Dunn
NSF Graduate Research Fellow
Jo Lab of Vascular Mechanobiology and Disease
Dept. of Biomedical Engineering
Georgia Institute of Technology & Emory University
Reply all
Reply to author
Forward
0 new messages