I am a first time user of RTG tools. Using version RTG Tools 3.6.2.
I am trying to evaluate annotations that go into determining VQSLOD for VQSR analysis. I have a multi-sample VCF for ~10,000 WES samples. I am trying to implement a workflow discussed by GATK to plot ROC curves for annotations (QUAL, QD, FS, etc.) in their most recent workshop. They've used a single sample to demonstrate the efficacy of VQSR filtering and how it compares to hard-filtered or unfiltered callsets. While I was trying to implement the workflow on my multi-sample VCF, I got an error message " Error: No sample name provided but calls is a multi-sample VCF."
Can I use vcfeval to analyze more than one sample at a time using --sample=<calls_SampleName>? (I have an underscore "_" in my sample names, is this going to be an issue, if RTG uses it as a delimiter?)
If analysis of more than one sample is permitted, do I have to explicitly provide a complete list of samples that I wish to analyze using --sample=<calls_sampleName>
My interest lies in specifically looking at each of the annotations (QD, QUAL, FS and others) in the "INFO" column of my VCF file and plot ROC curves for SNPs and INDELs.
Could you please provide recommendations on how best to get vcfeval to work on my multi-sample VCF?
Thanks,
Joseph.