Hi Jack,
Thanks for reaching out!
I am not super familiar with the intricacies of the different calling methods and filtering for that TCGA MAF file. I do recall that Mutect2 tends to call more events than other callers (or at least used to):
https://bmcmedgenomics.biomedcentral.com/articles/10.1186/s12920-019-0636-y
I imagine that might be one of the reasons, but to get more certainty one would probably need to do a more comprehensive comparison between the MAFs comparing allele frequencies and the differences in types of alterations. Maybe someone else on the mailing list more familiar with this specific file is able to provide a better answer. Glad to hear though that the current GDC MAF is more consistent with what’s in cBioPortal
> Furthermore, do you have any advice on where the best place to obtain updated and accurate mutational info for TCGA-COAD is as this is currently foundational for my PhD project?
I would use the TCGA Pancan data provided here:
https://gdc.cancer.gov/about-data/publications/pancanatlas
You can also use the cBioPortal mutation data, though note that we apply some harmonization (re-annoation thru Genome Nexus), to make the data more suitable for comparison with other mutation data in cBioPortal:
https://github.com/cBioPortal/datahub/tree/master/public/coadread_tcga_pan_can_atlas_2018
Hope that helps!
Best wishes,
Ino
--
You received this message because you are subscribed to the Google Groups "cBioPortal for Cancer Genomics Discussion Group" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
cbioportal+...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/cbioportal/90efedf8-242a-4538-8f15-6c0efb3199fbn%40googlegroups.com.