Best way to download hotspot mutations for TCGA samples?

146 views
Skip to first unread message

Ruben Drews

unread,
Nov 6, 2023, 2:09:05 PM11/6/23
to cbiop...@googlegroups.com

Hi,

 

I am trying to get the mutation information of the cancer hotspot genes for the TCGA cohort. In detail, I want to know which TCGA sample has which mutation in any of the hotspot genes?

 

Downloading the whole dataset via the “Data Sets” tabs doesn’t work as the column “hotspots” only contains zeros (even for positive hits like BRAF V600E). This is also the case for multiple versions of the data (Pancancer, Firehose, single paper source). The only way I identified to get the data I want is by manually querying the hotspot genes and then download the Mutations file from the Download tab. Is this the best way or is there a more automated (and even a programmatic) way of doing so?

 

Thank you and best wishes,

Ruben

GSK monitors email communications sent to and from GSK in order to protect GSK, our employees, customers, suppliers and business partners, from cyber threats and loss of GSK Information. GSK monitoring is conducted with appropriate confidentiality controls and in accordance with local laws and after appropriate consultation.




This e-mail was sent by GlaxoSmithKline Services Unlimited
(registered in England and Wales No. 1047315), which is a
member of the GlaxoSmithKline group of companies. The
registered address of GlaxoSmithKline Services Unlimited
is 980 Great West Road, Brentford, Middlesex TW8 9GS.

Benjamin Gross

unread,
Nov 8, 2023, 11:40:10 AM11/8/23
to Ruben Drews, cbiop...@googlegroups.com, de Bruijn, Ino
Hi Ruben,

The cBioPortal annotates variants on the fly using our Genome Nexus system (genomenexus.org).  I would suggest you write a program to read in a MAF of interest and for each variant make a call to the genome nexus annotation service to grab hotspot annotations:


Each of these endpoints will allow you to specify hotspot annotation in the return.  This one is probably best suited for the MAF


Let us know if you have any followup questions.

Best,
Benjamin

-- 
You received this message because you are subscribed to the Google Groups "cBioPortal for Cancer Genomics Discussion Group" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cbioportal+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/cbioportal/VI1P137MB01571247675A334812743CB4BEAAA%40VI1P137MB0157.EURP137.PROD.OUTLOOK.COM.

Ruben Drews

unread,
Nov 9, 2023, 10:29:51 AM11/9/23
to Benjamin Gross, cbiop...@googlegroups.com, de Bruijn, Ino

Hi Benjamin,

 

That is good to know. Thank you for pointing me towards Genome Nexus and their APIs!

It looks very close to what I had imagined.

 

Best,

Ruben

 

From: Benjamin Gross <benjami...@gmail.com>
Date: Wednesday, 8 November 2023 at 16:42
To: Ruben Drews <ruben....@gsk.com>
Cc: cbiop...@googlegroups.com <cbiop...@googlegroups.com>, de Bruijn, Ino <debr...@mskcc.org>
Subject: Re: [cbioportal] Best way to download hotspot mutations for TCGA samples?

Hi Ruben, The cBioPortal annotates variants on the fly using our Genome Nexus system (genomenexus. org). I would suggest you write a program to read in a MAF of interest and for each variant make a call to the genome nexus annotation service

Reply all
Reply to author
Forward
0 new messages