My Amplification/Deletion Score GISTIC plot looks much more noisy than the previous TCGA marker paper for the same cancer type (clear cell renal carcinoma) using SNP array data.
Aside from the noisy plot, I cannot find any reported amplified/deleted genes in my outputs except the %samples with arm-level (5q, 14q) is close to the TCGA paper.
Although their are differences between the TCGA study and mine:
- the TCGA paper used SNP array data instead of WES data
- the algorithms for segmentation is different
- TCGA:
- Segmented copy number profiles were analyzed using Ziggurat deconstruction [3,5] to determine the most likely set of events contributing to these profiles, and the lengths, amplitudes, and locations of these events.
- mine: GATK4 CNV
- the TCGA paper is 3 times my sample size
I'm not sure if my output is abnormal since I haven't found any paper has used GISTIC2.0 on WES CNV results. So I'm wondering has anyone has experience on this and tell me if anything looks wrong.
Thank you
ps:
I did use the parameters as close as the TCGA paper, which is in their supplement:
Absolute log2 ratios greater than 1.5 were capped to 1.5 to reduce hypersegmentation due to variations in dynamic range between probes, and events whose absolute amplitude was less than a log2 ratio of 0.1 were excluded from further analysis as likely to represent noise. Events whose length was greater than and less than 50% of the chromosome arm on which they resided were called arm-level and focal events, respectively, and these groups of events were analyzed separately using GISTIC 2.0 [5]. Regions were considered significant if assigned False Discovery Rate [6] q-values < 0.25.