RE: about data_mrna_seq_v2_rsem

31 views
Skip to first unread message

Ren, Danqing

unread,
Sep 9, 2025, 8:36:34 AMSep 9
to cbiop...@googlegroups.com

Hello Sir or Madam,

May I ask a question:
In the file titled data_mrna_seq_v2_rsem included into TCGA dataset, what is the data type of a certain gene mrna expression: FPKM, FPKM UQ or TPM?

How can I identify or clarify this information?
Thanks,

Dannie

 

 

From: Ren, Danqing
Sent: Tuesday, September 9, 2025 12:05 AM
To: cbiop...@googlegroups.com
Subject: about data_mrna_seq_v2_rsem

 

Hello Sir or Madam,

May I ask a question:
In the file titled data_mrna_seq_v2_rsem, what is the data type of a certain gene mrna expression: FPKM, FPKM UQ or TPM?

How can I identify or clarify this information?
Thanks,

Dannie

Guizela Huelsz Prince

unread,
Sep 22, 2025, 6:03:32 AM (6 days ago) Sep 22
to Guizela Huelsz Prince, Danqi...@uth.tmc.edu, cbiop...@googlegroups.com
Hi Dannie,

The data in data_mrna_seq_v2_rsem correspond to normalized counts (rsem.genes.normalized_results) from RSEM. You can find more details here: https://docs.cbioportal.org/user-guide/faq/#how-is-tcga-rnaseqv2-processed-what-units-are-used

Best,
Guizela

Ren, Danqing

unread,
Sep 22, 2025, 1:55:24 PM (6 days ago) Sep 22
to Guizela Huelsz Prince, cbiop...@googlegroups.com

Hello Guizela,

Thank you very much for your clarification. I just wanted to confirm one more point. From my understanding, the data_mrna_seq_v2_rsem file in the TCGA dataset corresponds to RSEMnormalized results, which represent FPKM (Fragments Per Kilobase of transcript per Million mapped reads) values, rather than TPM or FPKM-UQ. Therefore, it would be appropriate to report the mRNA expression values in FPKM units when referring to specific genes.

 

Thanks again for your helpful response.

Best regards,
Dannie

 

From: Guizela Huelsz Prince <gui...@se4.bio>
Sent: Monday, September 22, 2025 5:03 AM
To: Guizela Huelsz Prince <gui...@se4.bio>; Ren, Danqing <Danqi...@uth.tmc.edu>; cbiop...@googlegroups.com
Subject: Re: about data_mrna_seq_v2_rsem

 

External: Increase caution when handling links and attachments.

Guizela Huelsz Prince

unread,
Sep 24, 2025, 4:09:17 AM (4 days ago) Sep 24
to Guizela Huelsz Prince, Danqi...@uth.tmc.edu, cbiop...@googlegroups.com
Hi Dannie,

The values in data_mrna_seq_v2_rsem are neither FPKM, nor FPKM-UQ, nor TPM. They are RSEM-normalized counts (rsem.genes.normalized_results), which are counts scaled using the upper-quartile percentile normalization (divide raw counts by the 75th percentile of nonzero counts for the sample, ×1000 as described here).

Best,
Guizela
Reply all
Reply to author
Forward
0 new messages