Hi Naomi,
Peter is right - it’s the first two or three rows in each plate that matter most if sending to Diversity Arrays, and it should represent the breadth of diversity.
There isn’t a simple equation you can use, unfortunately. This is because the amount of diversity you are sending varies from others. If you have different species, then equal representation of both species would be best. If you’re just including a few individuals of a second species as an outgroup, then put those individuals up front. If you are looking across a widespread species, you would try to balance across the range. If you don’t know this yet because that’s the question you’re trying to answer, then representing your suspected distinct groups equally is the best option.
And remember that the more divergent your groups are in a single SNP calling exercise, the fewer SNPs you’ll get. This is true for all methods of ddRAD/dartSeq because variable sites become polyallelic and/or the enzyme binding size evolve/fail and you get reduced call rate for loci across groups. You can also get weird biases in your data. This is just an issue around who you include in each set of SNPs, not an issue with the raw sequence data.
When this happens you can improve your power/SNP number by calling SNPs on individuals specific to your question. This can be done by either taking Diversity Arrays up on their offer for more than one set of SNPs from your order, or calling SNPs yourself using Stacks or similar from the raw data. You don’t need to do different orders (the raw data is good), you just need to remember that one set of SNPs can’t answer all questions because of the biases inherent in biallelic data.
Cheers,
Renee
> To unsubscribe from this group and stop receiving emails from it, send an email to
dartr+un...@googlegroups.com.
> To view this discussion visit
https://groups.google.com/d/msgid/dartr/fd14d870-5b58-41d4-a123-993d90dc42fc%40unmack.net.