Hi Daniel,
Thanks for the information. It's great to see the Genome Browser team is working on the next version of TransMap. Given that the data is pre-release would there be any way for our team to gain access to the non-quality verified version of these tracks in the form of a bigPsl? Since we need these tracks primarily to determine if our current method for deriving an equivalent set of isoforms between species is introducing any biases, non quality-verified data should serve our purposes!
If this is not possible I completely understand and appreciate your response regardless!
________________________________
From: Daniel Schmelter <
dsch...@ucsc.edu>
Sent: Friday, May 7, 2021 8:15:00 PM
To: Castro-Pacheco, Nathan
Cc:
gen...@soe.ucsc.edu
Subject: Re: [genome] Interested in using transMap V5 for our Great Ape Study (panTro6, gorGor5, hg38)
Hello Nathan,
Thanks for emailing the Genome Browser team and for your patience with this response.
We are working on updating TransMap tracks still, and we have some pre-release data that may suit your purposes. These tracks are on TransMap v6 and they have additional information that could include your species. Since this is pre-release data, it has not been quality-verified and may have an incomplete or not-updated track description page. The process to create the data should generally be valid.
https://genome-preview.ucsc.edu/cgi-bin/hgTrackUi?db=gorGor5&g=transMapV6<
https://urldefense.proofpoint.com/v2/url?u=https-3A__genome-2Dpreview.ucsc.edu_cgi-2Dbin_hgTrackUi-3Fdb-3DgorGor5-26g-3DtransMapV6&d=DwMFaQ&c=mkpgQs82XaCKIwNV8b32dmVOmERqJe4bBOtF0CetP9Y&r=WN1C9rezXLyD-Cev1xw3-A&m=pN6qEXsitc2PdIRFBUI1V7w3z4fFlXtX2oZzHGRAdx8&s=5rcgNMBdtqlp3zyNI1Y1Lrnuc0zLTaWOC2O8tWIT0JM&e=>
https://genome-preview.ucsc.edu/cgi-bin/hgTrackUi?db=panTro6&g=transMapV6<
https://urldefense.proofpoint.com/v2/url?u=https-3A__genome-2Dpreview.ucsc.edu_cgi-2Dbin_hgTrackUi-3Fdb-3DpanTro6-26g-3DtransMapV6&d=DwMFaQ&c=mkpgQs82XaCKIwNV8b32dmVOmERqJe4bBOtF0CetP9Y&r=WN1C9rezXLyD-Cev1xw3-A&m=pN6qEXsitc2PdIRFBUI1V7w3z4fFlXtX2oZzHGRAdx8&s=-MLHiXjHYxrEs1daS8mE0gZ3XV3i8GGhNwMVAT5P6dw&e=>
You are probably already aware of our offerings on hg38. There is no pre-release TransMap data for this assembly, only the V5.
https://genome.ucsc.edu/cgi-bin/hgTrackUi?db=hg38&g=transMapV5<
https://urldefense.proofpoint.com/v2/url?u=https-3A__genome.ucsc.edu_cgi-2Dbin_hgTrackUi-3Fdb-3Dhg38-26g-3DtransMapV5&d=DwMFaQ&c=mkpgQs82XaCKIwNV8b32dmVOmERqJe4bBOtF0CetP9Y&r=WN1C9rezXLyD-Cev1xw3-A&m=pN6qEXsitc2PdIRFBUI1V7w3z4fFlXtX2oZzHGRAdx8&s=Y9VGIXENgUz2hxkDtykabea6ybIlZ81GqzfWw60y9fk&e=>
As for when this, or similar data will be released, we cannot give a timeline. It is a priority to get to this at some point. I hope this was helpful.
For further communication, please reply-all to
gen...@soe.ucsc.edu<mailto:
gen...@soe.ucsc.edu>. Those emails are archived in a public help forum. For private questions, you may send emails instead to
genom...@soe.ucsc.edu<mailto:
genom...@soe.ucsc.edu>.
All the best,
Daniel Schmelter
UCSC Genome Browser
On Mon, Apr 26, 2021 at 9:53 AM Castro-Pacheco, Nathan <
cas...@cshl.edu<mailto:
cas...@cshl.edu>> wrote:
Hi there UCSC Genome Institute Representative,
I'm a researcher at Cold Spring Harbor Laboratory in Long Island NY interested in deriving a common set of transcripts (with Ensembl IDs) between panTro6, gorGor5, and hg38 to aid in a Great Ape study we are working on.
Initially, we converted panTro6.ensembl.transMapV5.bigPsl & gorGor5.ensembl.transMapV5.bigPsl to psl format using bigPslToPsl and only kept alignments in each where qName (column 10) began with hg38 meaning the alignment was to a human transcript.
After extracting aligned human transcripts for both panTro6 and gorGor5 we found the intersection of these Transcript IDs thereby giving us a common set between all three species.
We wanted to use the same approach for panTro6 and gorGor5 of extracting transcripts from alignments to a target species but this would require that:
- For panTro6 derivation: Both gorGor5.ensembl.transMapV5.bigPsl & hg38.ensembl.transMapV5.bigPsl contain alignments to panTro6
- For gorGor5 based derivation: Both panTro6.ensembl.transMapV5.bigPsl & hg38.ensembl.transMapV5.bigPsl contain alignments to gorGor5
However, neither of these cases are met as both panTro6 and gorGor5 only contain alignments to mm10 and hg38 and hg38 does not contain alignments to either panTro6 or gorGor5.
With that said, we were wondering if your team at UCSC would be interested in deriving these alignments between transcripts of the three species of interest or sharing the pipeline used to derived TransMap files in order to allow us to do this ourselves. Either way, we'd be happy to include UCSC Genome Institute in our acknowledgments of any publications we produce using TransMap V5 data.
Hopefully, we'll be able to derive these alignments in some way or another
Thanks,
Nathan Castro Pacheco<
https://www.linkedin.com/in/nathan-castro-pacheco/<
https://urldefense.proofpoint.com/v2/url?u=https-3A__www.linkedin.com_in_nathan-2Dcastro-2Dpacheco_&d=DwMFaQ&c=mkpgQs82XaCKIwNV8b32dmVOmERqJe4bBOtF0CetP9Y&r=WN1C9rezXLyD-Cev1xw3-A&m=pN6qEXsitc2PdIRFBUI1V7w3z4fFlXtX2oZzHGRAdx8&s=Nqs4BC9zCj0mPxWzBXOE7-BUiZAgex-KT36TU0dUdlM&e=>>
To unsubscribe from this group and stop receiving emails from it, send an email to
genome+un...@soe.ucsc.edu<mailto:
genome%2Bunsu...@soe.ucsc.edu>.
To view this discussion on the web visit
https://groups.google.com/a/soe.ucsc.edu/d/msgid/genome/968bee162e3f4e398189f70552ec6b97%40cshl.edu<
https://urldefense.proofpoint.com/v2/url?u=https-3A__groups.google.com_a_soe.ucsc.edu_d_msgid_genome_968bee162e3f4e398189f70552ec6b97-2540cshl.edu&d=DwMFaQ&c=mkpgQs82XaCKIwNV8b32dmVOmERqJe4bBOtF0CetP9Y&r=WN1C9rezXLyD-Cev1xw3-A&m=pN6qEXsitc2PdIRFBUI1V7w3z4fFlXtX2oZzHGRAdx8&s=VsFzJPzLH9ysNQvsmgWqTvTR2fmtxS0bh2VdqsbjoXU&e=>.