VCFEVal cannot found sample in multi-sample VCF

51 views
Skip to first unread message

Kübra Narcı

unread,
Dec 30, 2020, 3:39:18 AM12/30/20
to RTG Users
Hi, I am comparing a series of samples in a multi sample VCF with a single sample containing VCFs. I can compare every sample in the multi-sample VCF with its companient (same sample name), but not the last sample (in the header). I am sure the sample is inside the VCF file and I am sure that I am comparing true single VCF (same sample name). 
Moreover, I reproduce this error with trying different ordered samples and issue is the same VCEval cannot found the last sample of multi-sample VCF. 

 Error: Sample "NA18874" not found in calls VCF.

Best,
Kübra Narcı

Sean Irvine

unread,
Dec 30, 2020, 3:56:47 AM12/30/20
to Kübra Narcı, RTG Users
Hi Kübra,

I cannot be sure of your problem without more information, but it sounds like your calls VCF file is not correctly formatted.  How are you making that file?

You can try,

rtg vcfstats --sample NA18874 calls.vcf.gz

to check whether the specified sample is in the VCF file.

Also, you can try,

zgrep -m1 '^#CHROM' calls.vcf.gz

and check that the sample name appears in the list.

Otherwise, please post the exact command lines you are running, and if possible the header lines from the calls and baseline VCF files.

Hope this helps,
Sean.





--
You received this message because you are subscribed to the Google Groups "RTG Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to rtg-users+...@realtimegenomics.com.
To view this discussion on the web visit https://groups.google.com/a/realtimegenomics.com/d/msgid/rtg-users/ff6a289f-8458-4c99-867e-9f298b1cb0a9n%40realtimegenomics.com.

Kübra Narcı

unread,
Dec 30, 2020, 6:30:36 AM12/30/20
to Sean Irvine, RTG Users
I have also regenerated the error using two methods; 
1. I got a subset of samples from a big VCF file  (bcftools view with a sample list)
2. using bcftools merge to merge a series of VCFs

rtg vcfstats runs correctly with the sample.

Note: I performed by comparison successfully by simply subsampling the problematic sample and re-running vcfeval with it. 

Thanks 
Kübra Narcı

PhD Candidate
Department of Health Informatics 
Middle East Technical University 



Len Trigg

unread,
Jan 5, 2021, 4:07:24 PM1/5/21
to Kübra Narcı, Sean Irvine, RTG Users
Hi Kübra,

Can you please attach a small baseline VCF and call VCF so we can replicate your error message here? (The VCF files probably only need to contain a single variant). Please also paste the exact command line that shows the problem with your example VCF files.

Cheers,
Len.



Reply all
Reply to author
Forward
0 new messages