It is a little confusing, but it is expected that some variants that fail hard filters will end up as PASS after SVM filtering. The others will be relabeled as failing SVM.
Let me try to explain.
GotCloud has a 2-step filtering approach.
First uses the "hard" filters to identify reads that fail a set of thresholds. Since it is difficult to calibrate these thresholds to best identify "failed" reads, we follow up the hard filter step with SVM filtering.
The SVM filter uses the sites that fail multiple hard filters as false positives and combines those with some external information for positive examples to train itself to best identify which sites should be marked as PASS and which sites fail.
So the idea is that just because a site fails a single hard filter that is difficult to perfectly calibrate, it may not actually be a failure. The SVM filter remarks the sites as either pass or as failing SVM based on all of the inputs it receives (from both external reference files and from the hard filtering information).
Does that help clarify how GotCloud works and why some of the hard filtered sites appear as PASS?
As for your situation, are all of the hard filtered sites in the PASS file or just some of them? They would be relabeled as failing SVM rather than as failing the hard filters they failed in the hard filter step.
If you have further questions, please let me know.
Mary Kate Wing