Hi JJ,
Thank you very much for your reply. I think I get the basic idea behind the rppa assays now. Checking the level3 normalized data from
https://bioinformatics.mdanderson.org/public-software/tcpa/ in terms of UCEC the number of samples differ from the cbioportal file, so I assume you used another version?
Regarding z scores, for example in R:
> data_rppa<-fread("~/ucec_tcga_pan_can_atlas_2018/data_rppa.txt", stringsAsFactors = FALSE)
> data_rppa_z<-fread("~/ucec_tcga_pan_can_atlas_2018/data_rppa_Zscores.txt", stringsAsFactors = FALSE)
> data_rppa_df<-setDF(data_rppa)
> data_rppa_z_df<-setDF(data_rppa_z)
> data_rppa_df$means<-rowMeans(data_rppa_df[,2:424], na.rm = TRUE)
> data_rppa_df$sd<-apply(data_rppa_df[,2:424],1, sd, na.rm = TRUE)
#First protein, first sample z score calculation
(data_rppa_df[1,2]-data_rppa_df[1,425])/data_rppa_df[1,426]
[1] 0.2542827
#Second protein, first sample z score calculation
(data_rppa_df[2,2]-data_rppa_df[2,425])/data_rppa_df[2,426]
[1] -1.849435
#And these are the values in the zscore rppa file:
> data_rppa_z_df[1,2]
[1] 0.1918
> data_rppa_z_df[2,2]
[1] -1.8583
Thank you for your time and your attention.
Best,
Oscar