Luka Topalovic
unread,Oct 14, 2023, 11:45:06 PM10/14/23Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to RTG Users
Hi, I tried comparing Deepvariant (long read sequencing) vs GATK HC (short read sequencing) on HG002 sample, where HC is baseline and Deepvariant is call VCF:
./rtg-tools-3.12.1/RTG.jar vcfeval -b HG002_GRCh38_1_22_v4.2.1_benchmark.vcf.gz -c HG002.m84011_220902_175841_s1.GRCh38.deepvariant.phased.filtered.pass.vcf.gz -t GRCh38.sdf -o WGS_vs_Revio
This is what I got:
Threshold True-pos-baseline True-pos-call False-pos False-neg Precision Sensitivity F-measure
----------------------------------------------------------------------------------------------------
29.000 3902625 3903326 294655 145612 0.9298 0.9640 0.9466
None 4035537 4037948 1202303 12700 0.7706 0.9969 0.8692
I understand that TPB represents number of calls in HC, with applied filtering for long indels, GT=0/0, etc, but how can number of TPC be larger than TPB? I am attaching log file if that helps.
Thanks for all the help,
Luka