GDC-TCGA/Xena Browser Data Differences

73 views
Skip to first unread message

Ashwin Mukund

unread,
Sep 19, 2023, 11:02:53 AM9/19/23
to UCSC Xena and Cancer Genomics Browser
I was looking into HTseq RNA Gene Expression data (fpkm) GDC TCGA-LUSC data from the Xena browser, and I am trying to match the expression values there to the data on GDC TCGA-LUSC. I was able to match some of the sample ID's to the case ID's on the TCGA-LUSC page and download the STAR counts file containing Gene Expression Quantification information. Xena states that the FPKM values were log2 transformed, but when I try to match a gene value from Xena to a gene in the STAR counts file, using this transformation it does not work. Can someone validate whether I am looking in the correct place? Or are there additional data transformations in the Xena file that would allow me to match the data?

Mary Goldman

unread,
Sep 19, 2023, 6:00:37 PM9/19/23
to Ashwin Mukund, UCSC Xena and Cancer Genomics Browser
Hi Ashwin,

Our data from the GDC was last updated in 2019 so this is the reason you are seeing the discrepancy with our data and the most current data from the GDC. We are currently working to update our GDC data.

Best,
Mary
-----
Mary Goldman (she/her), Design and Outreach Engineer 

A button with "Hear my name" text for name playback in email signature



--
You received this message because you are subscribed to the Google Groups "UCSC Xena and Cancer Genomics Browser" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ucsc-cancer-genomics...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/ucsc-cancer-genomics-browser/ff18ad4c-25a7-41c4-92c4-da6ccca46ab2n%40googlegroups.com.

Agustina Angeloni

unread,
Feb 9, 2024, 10:33:41 AMFeb 9
to UCSC Xena and Cancer Genomics Browser
Is it for this reason also that the beta values of methylation in Xena browser have variations with respect to the beta values provided in the .txt to the GDC app? As I understand it , the Sesame package was after the publication of the Xena data... Thank you

Mary Goldman

unread,
Feb 9, 2024, 3:12:04 PMFeb 9
to Agustina Angeloni, UCSC Xena and Cancer Genomics Browser
Hi Augustina,

Yes, exactly. The files that we host are from 2019 and underwent different harmonization and processing than the files currently on the GDC.

Best,
Mar
Reply all
Reply to author
Forward
0 new messages