Dear Dr. Roth,
I am try to generate the tsv input files for PyClone. Because the example tsv in pyclone github seems do not have CNV so I borrow the example tsv from PyClone VI (
https://zenodo.org/record/4268826#.YFIS1ZNKhTY). (I assume the definition is the same in PyClone and PyClone VI)
I am a little confuse about the major_cn in the examples of PyClone VI
mutation_id sample_id ref_counts alt_counts normal_cn major_cn minor_cn tumour_content
CRUK0100:1:1153814:G R1 260 128 2 4 0 0.68
CRUK0100:1:3782573:G R1 82 2 2 4 0 0.68
CRUK0100:3:126227155:G R1 124 32 2 3 0 0.68
My perception is the major_cn and minor_cn value is derive from CNV data and intersect it with the SNV data. The major_cn refer to the cn of major allele (most of time is the ref), and the minor_cn refer to the cn of the minor allele (most of time is the alt).
The CRUK0100:1:3782573:G is easy to understand, this loci has duplication with 4 copy and majority of it is ref, so major_cn is 4.
But the CRUK0100:1:1153814:G is confusing, the loci still has duplication with 4 copy, but the ref:alt ratio seem closer to 2:1, how to get the major_cn 4 and minor_cn 0.
Please correct me.
best wishes
Ming