MDS IWG 2022 dataset - ask for a data dictionary

31 views
Skip to first unread message

An-Ting Jhuang

unread,
Sep 4, 2025, 10:38:21 AMSep 4
to cbiop...@googlegroups.com
Hi,

I'm a data scientist in RefinedScience and doing MDS-related research. I recently found your great data source and have been exploring the MDS IWG 2022 dataset. Is there a data dictionary? I explored FAQs and the google slide but couldn't find it there.

I'm asking because I find there are 397 TP53 mutated patients from the dashboard, but it doesn't match the count of non-normal values in the column "Chromosomal status at TP53." I assume the count of non-normal (cnloh, del, gain, iso17q) is 397, but it's 189. Wondering if I've missed anything.

     

Please let me know if you need further information from me.

Thank you!
An-Ting Jhuang

An-Ting Jhuang

unread,
Sep 18, 2025, 9:22:25 AM (8 days ago) Sep 18
to cbiop...@googlegroups.com
Hello,

Hope you've had a nice weekend. I'm writing to follow up an email I sent out earlier. Any feedback would be greatly appreciated. Also, please don't hesitate to reach out if you need any clairfications.

Thank you,
An-Ting

From: An-Ting Jhuang
Sent: Thursday, September 4, 2025 9:37 AM
To: cbiop...@googlegroups.com <cbiop...@googlegroups.com>
Subject: MDS IWG 2022 dataset - ask for a data dictionary
 

Guizela Huelsz Prince

unread,
Sep 22, 2025, 5:48:35 AM (4 days ago) Sep 22
to Guizela Huelsz Prince, anting...@refinedscience.com, cbiop...@googlegroups.com
Hi An-Ting Jhuang,

The 397 samples refer to small TP53 mutations such as SNVs and INDELs, while the column "Chromosomal status at TP53" appears to refer to larger-scale chromosomal changes. Therefore, some patients might have a small mutation in TP53, but no chromosomal abnormalities. Please have a look at this publication for further details: https://pmc.ncbi.nlm.nih.gov/articles/PMC8381722/

Best,
Guizela
Reply all
Reply to author
Forward
0 new messages