Hi Aishwarya,
Thank you for using the UCSC Genome Browser and your question about identifying conserved binding sites of a protein between humans and rats.
Rather than the net files, you should likely look at the chain link table, hg19.rn4.all.chain.gz, which would identify the corresponding sequence in the other species. Here is a link with information about the chain format, http://genome.ucsc.edu/goldenPath/help/chain.html, and for comparison the page about the net format: http://genome.ucsc.edu/goldenPath/help/net.html
The chain format is deliberately quite dense where an initial header line starts a section followed by alignment data lines which represent the size of ungapped alignments and difference between block sizes. Here are some example lines from hg19.rn4.all.chain.gz:
chain 578869426 chr14 107349540 + 25151529 106332175 chr6 147636619 + 64051113 138454126 1
17 2 0
57 0 1
21 2 0
4 4 0
...
You could also examine the reciprocal best file for a more filtered answer. Ordinary liftOver chains are not necessarily equivalent in both directions, while the filtered reciprocal best chains are generated for that purpose. A reciprocal best chain exists for rn6:http://hgdownload.soe.ucsc.edu/goldenPath/hg19/vsRn6/reciprocalBest/ but not for rn4. Here are some example lines from hg19.rn6.rbest.chain.gz (note new rn6 chr6 coordinates mapping from the same hg19 region on chr14):
chain 567867938 chr14 107349540 + 25151529 106332175 chr6 147991367 + 64895805 138251111 1
17 2 0
57 0 1
21 2 0
...
Please know we have mailing list archives you can search to find answers that might be of interest. For example some previous answers point users on how to build reciprocal best chains: https://groups.google.com/a/soe.ucsc.edu/forum/?hl=en&fromgroups#!searchin/genome/reciprocal$20best
Thank you again for your inquiry and using the UCSC Genome Browser. If you have any further questions, please reply to gen...@soe.ucsc.edu. All messages sent to that address are archived on a publicly-accessible forum. If your question includes sensitive data, you may send it instead to genom...@soe.ucsc.edu.
All the best,
Brian Lee
UCSC Genomics Institute
--
Hi Aishwarya,
Thank you for your question about how to use the reciprocal best chains file.
Questions about how to process data fall outside the scope of this mailing list,
thus how you choose to use the rbest file is up to you.
We recommend Biostars: https://www.biostars.org/
to get help with more general questions like this one.
Thank you again for your inquiry and using the UCSC Genome Browser. If you
have any further questions, please reply to gen...@soe.ucsc.edu. All messages
sent to that address are archived on a publicly-accessible forum. If your
question includes sensitive data, you may send it instead to genom...@soe.ucsc.edu.
--
---
You received this message because you are subscribed to the Google Groups "UCSC Genome Browser discussion list" group.
To unsubscribe from this group and stop receiving emails from it, send an email to genome+un...@soe.ucsc.edu.
Hi Aishwarya,
Thank you for your question about the correct command to find orthologs between
rat and humans. Your command is indeed correct. If you ever have questions about
what a Kent utility does, run the command with no arguments, like so:
$ liftOver
This will print a usage message explaining what the program does and its various
arguments and options.
Note that liftOver was designed to work between two different assemblies of the same
species, not across species, as in this case. Thus, although your command may
technically work, it may produce undesired results.
Thank you again for your inquiry and using the UCSC Genome Browser. If you have any
further questions, please reply to gen...@soe.ucsc.edu. All messages sent to that
address are archived on a publicly-accessible forum. If your question includes sensitive
data, you may send it instead to genom...@soe.ucsc.edu.
--