Hi,
I'm testing the tassel 5 GBS v2 pipeline on the sequence data I have for 45 durum wheat samples, mostly cultivars. Durum is a tetraploid wheat by the way. The same data has been analyzed using the originall version of the tassel pipeline. I set the tag length at 100, and I've got 6X more SNP variants called from the v2 pipeline, quite impressive. However, I also got 30% overall rate of hets called using v2 as opposed to <1% using the original version.
I was wondering if the quantitative SNP calling method was implemented in the v2. When -c parameter was increased from 4 to10 in the TagExportToFastqPlugin, that only reduced the amount of variants called, but didn't help to reduce the amount of hets called. The default -eR of 0.01 in the ProductionSNPCaller Plugin V2 was used. Other than these two parameters, I'm not sure what other parameters I could play with.
Thanks for your thoughts.
Shiaoman