Hi All,
I have been running *BEAST analyses with fossil calibrations for a data set composed of 134 exemplars representing 45 species (at least two individuals for all but 1 ingroup taxon). The data set includes 9 loci with virtually complete coverage for each exemplar. I have struggled to obtain proper convergence with this data set and have tried all sorts of adjustments in an effort to resolve the problem, but to no avail. I have been running these analyses for 1 billion generations. I am using two calibrations and I have set up analyses with both calibrations (one young, one old), other analyses with just the old or just the young calibration, and some that are not calibrated at all. I am using a uniform prior for the deeper calibration with a range of 24-36 million years, and I have used a lognormal distribution with a mean in real time of 2.5 million years for the younger calibration. The problems that I have encountered include (1) posteriors that decline in value over time, with the shape of the trace closely matching those of the prior and the species.coalescent, and (2) recovered species tree topologies that actually make good sense, accept that the calibrated nodes are highly compressed toward the tips (as if there is almost no divergence among the members of calibrated clade) such that the scale for all other branches on the tree are gigantic. For example, in calibrated analyses in which the older calibration node has a prior distribution with a mean of 30 million years, the chronogram will reflect this age for this node, but the rest of the nodes, which appear much deeper in the tree, necessarily have really old node ages. In this case, the root of the tree will have a divergence estimate of 5 billion years, etc. The bizarre compressed node phenomenon only occurs for nodes that are tied to a calibration. In other analyses, in which the node in question is not used for calibration, the branch lengths within the group are much longer as expected. Also, in an earlier batch of analyses with a slightly less complete version of this data set, the posterior probability trace climbed throughout the analysis, with the branch lengths associated with the calibration nodes starting off compressed and then growing progressively over the course of the analysis. Unfortunately, in those analyses, 500 million generations wasn't enough to allow the tree shape to stabilize.
I have run analyses on each of these permutations of the data set (with different calibrations) with empty alignments. I wasn't able to see anything in these analyses that jumped out at me as an explanation for my bizarre results, but this may simply be an issue of my not knowing what I should be looking for.
I've been wondering if this might have something to do with an interaction between population size estimates and branch lengths, so I have tried running analyses using the 1/x prior for species.pop.mean as we all as an invariant gamma (which solved a colleague's convergence problems). This appears not to have made much difference, although the invgamma analyses are still running.
Any advice would be greatly appreciated!
Jim