Hi Vikram,
Thanks for your prompt response. I used the 'INDFILE' produced from
Structure as an input to Clumpp. I copied and pasted the output from
Clumpp back to the Structure output from the same 'k' level keeping
everything same in the file. Then opened this modified file again in
Structure and obtained the barplots.
The Clumpp run finished in less than 27 seconds, I don't have any
prior experience of running this and not sure if this is the usual/
common run time for everybody else?
The parameters (only the main and modified ones) used in Clumpp are as
follows:
# --------------- Main parameters
---------------------------------------------
DATATYPE 0 # The type of data to be read in.
# 0 = individual data in the file
# specified by INDFILE, 1 = population
# data in the file specified by
# POPFILE.
INDFILE tsb_tetra_k5.indfile # The name of the individual
datafile.
# Required if DATATYPE = 0.
POPFILE # The name of the population datafile.
# Required if DATATYPE = 1.
OUTFILE tsb_tetra_k5_pairmat1.outfile # The average cluster
membership
# coefficients across the permuted runs
# are printed here.
MISCFILE tsb_tetra_k5_pairmat1.miscfile # The parameters used and a
summary of
# the results are printed here.
K 5 # Number of clusters.
C 353 # Number of individuals or populations.
R 2 # Number of runs.
M 1 # Method to be used (1 = FullSearch,
# 2 = Greedy, 3 = LargeKGreedy).
W 0 # Weight by the number of individuals
# in each population as specified in
# the datafile (1 if yes, 0 if no).
S 1 # Pairwise matrix similarity
statistic
# to be used. 1 = G, 2 = G'.
======================================================
From your explanation it looks like the only need for Clumpp
processing is to get statistically significant results but if one only
has to see at gross level, Structure plots are also fine - is it
correct inference?
Regards,
S
On Oct 8, 5:02 pm, Vikram Chhatre <
crypticline...@gmail.com> wrote:
> I meant to say 'statistically *more* correct'. Sorry about the oversight.
>
> V
>
> On Mon, Oct 8, 2012 at 10:55 AM, Vikram Chhatre
>
>
>
>
>
>
>
> <
crypticline...@gmail.com> wrote:
> > S -
>
> > The barplots made from data processed through CLUMPP are statistically
> > correct because CLUMPP performs permutations using various algorithms
> > (of your choice) to match the independent iterations for the chosen
> > optimal value of K, as closely to each other as possible (Phew! long
> > sentence).
>
> > Please read up on the documentation manual and the accompanying paper
> > to get a better understanding.
> >
http://www.stanford.edu/group/rosenberglab/clumpp.html
>
> > If you can tell us exactly how you processed your data with clumpp and
> > post relevant example files here, someone should be able to help you.
>
> > All the best
> > V
>