Dear UCSC Xena Team,
first of all, a big thank you for making this amazing resource available!
I have a question related to the handling of multiple specimens or samples in the donor-centric cohort of ICGC, specifically for copy number somatic mutation and somatic coding mutations.
As far as I can tell from the "raw" data available from the ICGC data portal, for specific projects, some donors had multiple specimens and/or samples taken which in turn lead to duplicates in the somatic mutation and/or copy number data. I am finding these somewhat tricky to handle since it is not always apparent whether two specimens were taken at the same time or not.
I noticed that you only report donor id in your files in the donor-centric cohort, thus I was wondering how you dealt with these replicates?
Thank you very much in advance and best
David