Hello Paula,
Thank you for using the UCSC Genome Browser and for submitting your question regarding the possibility of downloading the "Restriction Enzymes" Mapping and Sequencing track (from REBASE). The actual mappings of the sequences to the genome are generated "on the fly," and therefore these data are not downloadable from the Table Browser. You can generate these coordinates for whole-genome or whole-chromosome yourself by following Option 2 below.
Option 1. View in the Genome Browser & export a coordinate range within a chromosome
Note: This option provides output limited to the genomic range you have selected in the browser.
Using the Genome Browser, you can select a coordinate range to view enzymes in the "Restr Enzymes" track under the "Mapping and Sequencing" track group. Be aware that zooming out provides fewer viewable enzymes (see the track description page for details). Once your region is selected, you can click into the enzyme on the browser graphic and then choose an option to download a BED formatted file of genomic coordinates for either all enzymes in the range, or you can filter for one or more enzymes in the range.
This video explains how to get BED file output from the browser for restriction enzymes:
Example rows from BED file:
Option 2. Use the command line utility "oligoMatch" to find perfect sequence matches
oligoMatch is a command-line utility which will find any sequence (e.g., a restriction enzyme (RE) cutting site) and match it to sequence. For example, if your RE cutting site sequence is "GATC" then oligoMatch will find that exact sequence, searching both strands by default. To do this, you will need the following:
- The oligioMatch utility
- Your RE sequence (e.g., GATC)
- Your reference sequence
Step A. Obtain your RE sequence
1. Set the following fields:
clade: Mammal
genome: Human
assembly: Select whichever human assembly you would like. I will use GRCh37/hg19 for this example.
group: All Tables
database: hgFixed
table: hgFixed.cutters Note: You can click on "describe table schema" to see the table description (enzyme cut site information).
filter: click "edit" and then
for the first row, set as name "does" match "DpnII" and click submit.
output format: selected fields from primary and related tables
output file: leave blank
2. Click "get output" to choose fields from the hgFixed.cutters table
Select the checkboxes for "name" and "seq".
3. Click "get output".
4. Example output:
For this example, we can call this file "RE.fa"
Step B: Obtain your reference sequence as the appropriate fie type ( .fa or.2bit). For this example, we will name this file "chr1.2bit".
Step C: Install the oligoMatch utility
Step D: Run the utility
For example: