Interesting questions. I've CC'd Simon Ho on this, because he probably has the most sensible things to say about the first question.
On Monday, 31 December 2012 17:39:05 UTC+13, Kai He wrote:
Dear Rob and everyone,
I have a couple of questions.
When analyzing a multi-locus data using BEAST, I was suggested to give each gene an independent clock model by one of the reviewers, However, I wonder is it still compatible with the partition strategy found by Partitionfinder? Partitionfinder sometime partitions the data set by codon position, like the 1st and 2nd codon positions of gene A and Bin partition 1, 3rd codon position of gene A and B in partition 2.
So the question is should I unlink the clock models to be identical to the site models or should I still unlink the clock model by gene?
There's no particular reason that clock models and site models need to match up. Any combination is possible, the problem is finding a good one (or the best one).
My pragmatic suggestion is to take the partitioning scheme from partitionfinder, and then try out linked and unlinked clocks on that scheme. I would then compare linked and unlinked clocks using Bayes Factors (which you can do in Tracer). Simon Ho and I have a paper which does something similar (although this was before we wrote PartitionFinder) here:
http://www.ncbi.nlm.nih.gov/pubmed/20795783
However, there's something a bit more fundamental here - PartitionFinder is written in a likelihood framework, and BEAST in a Bayesian framework. The two things are quite different, and although a partitioning scheme from PartitionFinder should do a perfectly decent job when used in BEAST, it would be more appropriate to do everything in a Bayesian framework. In part, this is because a truly Bayesian approach to partitioning would be to integrate across all possible partitioning schemes and clock models, but it's also true that the 'best' partitioning scheme in a Bayesian framework might differ from that in a likelihood framework. Until recently, there was no software available to do truly Bayesian partitioning, but this just changed - it's now possible to do it, and it's described in this paper:
http://t.co/Nau6rAQM. A note though, I haven't tried out the implementation of the approach, and I don't know how user-friendly it is. That would be one for the BEAST forums though.
Given that your paper is already in review, and presumably that new BEAST paper didn't exist when you started out, I think it would be reasonable to just compare linked and unlinked clock models (i.e. following the partitioning scheme you already have) with Bayes Factors, and ignore the more rigorous approach here. I just thought I'd post about it in case you or others were interested.
Besides, I wonder if anybody are testing or attempt to test different partition strategy using PS/SS etc.?
Sounds interesting! What are PS and SS?
Cheers,
Rob
Happy new year!
Kai He