"PAM50 array": The PAM50 calls for microarrays are available from the Nature 2012 paper associated Supplemental table 1. We obtained the PAM50 calls (based off the Agilent 244K custom gene expression microarrays), as well as ER, HER, PR status calls from the paper, which were used to create the view the PAM50 profile in TCGA breast cancer data. Click here to see the view.
"PAM50 RNAseq": We have also included preliminary PAM50 calls based on the Illumina HiSeq 2000 RNA Sequencing platform from the TCGA Analysis Working Group (AWG). Note that these calls are not final and are subject tochange.The PAM50 calls is based on centroid correlation distance to LumA, LumB, Basal, Her2-E, normal-like centroid and a sample could be nearly equally close to more than one centroid, so such samples can flip sometimes. For example flipping between LumA and LumB or flipping between luminal B and HER2-E happens sometimes.
I download the TCGA Breast Cancer data from the UNC web you posted. In the file "clinical_data.", there are three columns about PAM50 classification results.What are the difference between them?
1) The first one is "PAM50Call", is this based on TCGA Microarray data set?
2) The second one is "PAM50_mRNA_nature2012". Is this the PAM50 results
published in TCGA BRCA nature paper based on Microarray data?
So what is the
difference between "PAM50Call" and "PAM50_mRNA_nature2012"?
3) The third one is "PAM50Call_RNAseq". Is this based on TCGA RNA-seq data.
I also read the original PAM50 paper (JS Parker et al)from your UNC web.
1) The PAM50 predictor looked like the optimized centroids for 5 BRCA subtypes,
whose subtype information are obtained by previous unsupervised cluster
methods. So when we want to cluster a new sample, we only need to compare the
distance between the new sample and the centroids of 5 subtypes (PAM
predictor). Is my understanding right?
2) Can I use the PAM50 predictors download from UNC to predict a new sample
whose data set is by Microarray but different platform/protocol from those
used in Parker's paper?
3) How do you apply the PAM50 predictor on TCGA RNASeq data? Do you use some
RNASeq data samples (whose subtype information is known) to train new PAM50
predictor for RNASeq data set?
Thanks for your kind reply. I am wondering where did you get TCGA breast AWG PAM50 calls based on mRNAseq data? I could not find it on TCGA webpage. Or could you please send me a copy of it? The most recent the best.
Thank you very much!
Jinyu
> To unsubscribe from this group and stop receiving emails from it, send an email to ucsc-cancer-genomics-browser+unsub...@googlegroups.com.
--
Best
Dr. Singh