Hi,
Sorry my explanation was pretty bad. Here are some examples.
Example 1
Pos: 1234: Reference Base: 1000, 1000 (Forward, Reverse)
Pos: 1234: Variant Base: 100, 10 (Forward, Reverse)
In this example, the variant base is clearly biased, there is a clear strand bias, so it would fail the test.
Example 2:
Pos: 4321: Reference Base: 1000, 200 (Forward, Reverse)
Pos: 4321: Variant Base: 100,100 (Forward, Reverse)
In this example, the variant base is clearly not biased - it has equal number of reads in the forward and reverse direction. However, this will fail the strand bias test, even though it isn't strand biased. It seems that the test compares the bias in the variant base, to the bias in the reference base, and if they are significantly different it will fail. For example, if I take Example 2, and make the variant base more biased, it will now pass the strand bias test.
Example 3:
Pos: 4321: Reference Base: 1000, 200 (Forward, Reverse)
Pos: 4321: Variant Base: 100,20 (Forward, Reverse)
The variant will now pass the strand bias test, even though it is more strand biased than example 2.
We were just trying to get our head around what was going on, as he got a high frequency variant, that didn't appear to be strand biased failing the strand biased test (but it passed in other tools like LoFreq etc). And it seems it was failing because the reference base was biased, and as the variant was not it failed.
Does this explanation make sense? And is this just a side affect of using the fischers test.