Dear Mr./Mrs.,
Recently, I read though one paper "Identification of a Novel Coregulator, SH3YL1, That Interacts With the Androgen Receptor N-Terminus". I was especially interested in the survival analysis of UBN1 based on TCGA data, which was one of the results in this paper. With the author's help, I successfully repeated their KM plot using cBioPortal for cancer genomics (http://www.cbioportal.org/).
Because they tried to divide all the patients in TCGA for prostate cancer (PRAD) based on high/low UBN1 mRNA expression. Therefore only the mRNA expression data was used for survival analysis. The detailed parameter settings were shown in attached file (the 1st slide).
In TCGA provisional cohort, there are 498 PRAD samples with mRNA expression. But in the disease free survival results (the 3rd slide), there are in total 214 samples (23 over-expressed cases and 191 other cases). And in the overall survival result (the 2nd slide), there are only 215 PRAD samples. Do you know how the other sample in PRAD TCGA data set are filtered out?
I am looking forward to your kind reply. Thanks very much!
Best Regards!
Chen Xin