Orthologs between hg19 and rn4

41 views
Skip to first unread message

Aishwarya Kulkarni

unread,
Apr 7, 2016, 12:45:10 PM4/7/16
to gen...@soe.ucsc.edu
Hi,
I am trying to identify the conserved binding sites of a protein between humans and rats and after reading lot of the discussion topics , I am wondering if using net files would be the best option i.e the one located here:
http://hgdownload.soe.ucsc.edu/goldenPath/hg19/vsRn4/ , if so how would I use it on the command prompt and what would be the best way to validate the orthologous (hence conserved) between humans and rats,
Thanks
Aishwarya

Brian Lee

unread,
Apr 18, 2016, 7:45:30 PM4/18/16
to Aishwarya Kulkarni, gen...@soe.ucsc.edu

Hi Aishwarya,

Thank you for using the UCSC Genome Browser and your question about identifying conserved binding sites of a protein between humans and rats.

Rather than the net files, you should likely look at the chain link table, hg19.rn4.all.chain.gz, which would identify the corresponding sequence in the other species. Here is a link with information about the chain format, http://genome.ucsc.edu/goldenPath/help/chain.html, and for comparison the page about the net format: http://genome.ucsc.edu/goldenPath/help/net.html

The chain format is deliberately quite dense where an initial header line starts a section followed by alignment data lines which represent the size of ungapped alignments and difference between block sizes. Here are some example lines from hg19.rn4.all.chain.gz:

chain 578869426 chr14 107349540 + 25151529 106332175 chr6 147636619 + 64051113 138454126 1
17 2 0
57 0 1
21 2 0
4 4 0
...

You could also examine the reciprocal best file for a more filtered answer. Ordinary liftOver chains are not necessarily equivalent in both directions, while the filtered reciprocal best chains are generated for that purpose. A reciprocal best chain exists for rn6:http://hgdownload.soe.ucsc.edu/goldenPath/hg19/vsRn6/reciprocalBest/ but not for rn4. Here are some example lines from hg19.rn6.rbest.chain.gz (note new rn6 chr6 coordinates mapping from the same hg19 region on chr14):

chain 567867938 chr14 107349540 + 25151529 106332175 chr6 147991367 + 64895805 138251111 1
17 2 0
57 0 1
21 2 0
...

Please know we have mailing list archives you can search to find answers that might be of interest. For example some previous answers point users on how to build reciprocal best chains: https://groups.google.com/a/soe.ucsc.edu/forum/?hl=en&fromgroups#!searchin/genome/reciprocal$20best

Thank you again for your inquiry and using the UCSC Genome Browser. If you have any further questions, please reply to gen...@soe.ucsc.edu. All messages sent to that address are archived on a publicly-accessible forum. If your question includes sensitive data, you may send it instead to genom...@soe.ucsc.edu.

All the best,

Brian Lee
UCSC Genomics Institute


--


aishwarya kulkarni

unread,
May 3, 2016, 12:01:13 PM5/3/16
to Brian Lee, gen...@soe.ucsc.edu
Hi Brian,
How are the rbest files to be used after they are generated on the coordinates that I am interested in?

Aishwarya.S.Kulkarni

Christopher Lee

unread,
May 5, 2016, 7:06:58 PM5/5/16
to aishwarya kulkarni, Brian Lee, gen...@soe.ucsc.edu

Hi Aishwarya,

Thank you for your question about how to use the reciprocal best chains file.

Questions about how to process data fall outside the scope of this mailing list,
thus how you choose to use the rbest file is up to you.

We recommend Biostars: https://www.biostars.org/
to get help with more general questions like this one.

Thank you again for your inquiry and using the UCSC Genome Browser. If you
have any further questions, please reply to gen...@soe.ucsc.edu. All messages
sent to that address are archived on a publicly-accessible forum. If your
question includes sensitive data, you may send it instead to genom...@soe.ucsc.edu.

--

---
You received this message because you are subscribed to the Google Groups "UCSC Genome Browser discussion list" group.
To unsubscribe from this group and stop receiving emails from it, send an email to genome+un...@soe.ucsc.edu.

aishwarya kulkarni

unread,
May 23, 2016, 12:32:48 PM5/23/16
to Christopher Lee, Brian Lee, gen...@soe.ucsc.edu
Thanks Brian
I used following command to find the orthologs between humans and rats using rn6 rbest chains:
liftOver hg19_coordinates hg19.rn6.rbest.chain.gz  rat_coordinates unmapped_file
Is that the right approach to use?

Aishwarya.S.Kulkarni

Christopher Lee

unread,
May 25, 2016, 2:11:44 PM5/25/16
to aishwarya kulkarni, Brian Lee, gen...@soe.ucsc.edu

Hi Aishwarya,

Thank you for your question about the correct command to find orthologs between
rat and humans. Your command is indeed correct. If you ever have questions about
what a Kent utility does, run the command with no arguments, like so:

$ liftOver

This will print a usage message explaining what the program does and its various
arguments and options.

Note that liftOver was designed to work between two different assemblies of the same
species, not across species, as in this case. Thus, although your command may
technically work, it may produce undesired results.

Thank you again for your inquiry and using the UCSC Genome Browser. If you have any
further questions, please reply to gen...@soe.ucsc.edu. All messages sent to that
address are archived on a publicly-accessible forum. If your question includes sensitive
data, you may send it instead to genom...@soe.ucsc.edu.

aishwarya kulkarni

unread,
May 25, 2016, 4:52:33 PM5/25/16
to Christopher Lee, Brian Lee, gen...@soe.ucsc.edu
Thanks Chris, is there a substitute for liftOver that I can use from the available utilities in this case? Also is there a way to verify orthologs in genome browser?
Thanks

Aishwarya.S.Kulkarni

Matthew Speir

unread,
May 27, 2016, 12:06:06 PM5/27/16
to aishwarya kulkarni, Christopher Lee, Brian Lee, gen...@soe.ucsc.edu
Hello Aishwarya,

Thank you for your question.

We do not have any utilities or tools specifically designed for identifying and verifying orthologs. As my colleague Chris stated, the liftOver command you proposed should work, and will likely give you results, but we can't really guarantee that the results will be exactly what you are looking for. You will need to verify the results yourself, and you will have to determine how to best verify them.

I would recommend doing some research on techniques other scientists have used to identify and verify orthologs between two organisms. I would search our mailing list archives to see what questions others have asked about orthologs and what recommendations they had for finding them: https://groups.google.com/a/soe.ucsc.edu/forum/#!searchin/genome/orthologs. And, I would ask your questions about identifying orthologs on the BioStars (https://www.biostars.org/) or SeqAnswers (http://seqanswers.com/) forums. Both of those forums are intended to help people with more general research questions and may be able to provide some advice on your experimental design.

I hope this is helpful. If you have any further questions, please reply to gen...@soe.ucsc.edu. All messages sent to that address are archived on a publicly-accessible Google Groups forum. If your question includes sensitive data, you may send it instead to genom...@soe.ucsc.edu.

Matthew Speir
UCSC Genome Bioinformatics Group
--


Reply all
Reply to author
Forward
0 new messages