Hi all,
I'm the author of the Harvester, (
http://taylor0.biology.ucla.edu/
struct_harvest/).
In order to run the Evanno* method in the Harvester you need to have a
continuous series of K values, all with the same number of replicates.
If you skip K values (say run K=2, 3, 5) then the Evanno routine (in
the code) will not be run. If you have different numbers of replicates
(say K=2 has 10 reps, K=3 has 10, K=5 has 11) then the Evanno routine
will not be run. These restrictions come from my interpretation of the
Evanno method.
The Harvester will still pull out all of the summary data from your
results, make a nice plot for you, and create all of your CLUMPP
indfiles, regardless of the number of replicates or the sequentiality
of the K values. The issues I mentioned only affect whether or not the
Evanno routine gets run.
Pro-tip: I saw this discussed somewhere else and I thought it might be
useful to state the details how how I implemented the Evanno routine
in my code. It may seem silly to run K=1, but if you do then you can
get results from the Evanno method for K=2. The method of calculating
delta-K for any value (k) of K uses the next value of K (k+1) and the
*previous* value of K (k-1). Not (k+1) and (k+2) as one might guess
(for verification, check the Figure 2 in the Evanno paper and compare
part A to part D. From their caption:
L'(k) = L(k) - L(k-1) (1)
L''(k) = L'(k+1) - L'(k) (2)
Thus,
L''(k) = [L(k+1) - L(k)] - [L(k) - L(k-1)], substituting (1) into (2).
Note that in part A you have K in [2, 23] and in part D you have K in
[3, 22]. If you were to use (k+1) and (k+2), as I have seen advocated
elsewhere, you would not be able to get a value for delta-K for k=22).
So you might want to run STRUCTURE for at least one value of K less
than you think is reasonable and one value of K greater than you think
is reasonable.
I'd like to thank Alan for writing about the Harvester, and invite
people to email me (my preferred email is on the Harvester webpage)
questions, suggestions, or requests regarding the Harvester.
d
*Evanno et al., 2005. Detecting the number of clusters of individuals
using the software STRUCTURE: a simulation study. Molecular Ecology
14 , 2611 - 2620 <a href="
http://www3.interscience.wiley.com/journal/
118706173/abstract">link</a>