Dear Tracy and all,
DPPDIV is really awesome, but I have some questions about its performance on large datsets. I am using DPPDIV to date a tree with over 1000 species and almost 15Kb, and although the program is really fast and the results are in accordance with the expected dates (in tune with recent independent sources), some of the values in the .out file do not reach ESS>200. In fact, some of them are below 100. (I ran with the default configurations in CIPRES: 1 million cycles, gamma shape 2, gamma omrates 4, hyperprior gamma 2, cbd tree prior, and three uniform, soft bounded calibrations at deeper nodes).
I understand that for a dataset this size, it might be too much to ask for all high ESS values, but how is this really an issue, in such cases? I assume that at least the likelihood, root age and priors on calibration the nodes should always be >200 (which is my case), but what about the other (uncalibrated) nodes? Is it more important to have all values greater than 200 or check if multiple runs converge to the same overall dates for the tree (or both, ideally)?
In the paper from Arcila et al 2015 - Mol. Phylo. Evol. - they comment that ESS values for DPPDIV analyses were under 200 for many nodes, but the DPPDIV dates were quite consistent with the results of other programs used in that study. My concern with my data (from my modest experience with large datasets) is that it may not reach 'good' ESS in any other bayesian dating programs, since they were not designed for such scale.
I wondered if opting for the prior to Yule, or adopting minimum exponential calibration priors, could help the ESS values. I am running these alternative analyses at this moment, and as soon as I have the results I will report back. But I really want to hear from you about these ESS issues, they would help not only in these analyses, but in my formation as a whole.
Thanks a lot!!
Cheers!
Alex