Dear BEAST2/StarBEAST3 developers,
I am running a StarBEAST3 (v1.2.1) uncalibrated Stage I guide tree in BEAST 2.7.7 with CoupledMCMC v1.2.2 as part of a DELINEATE species delimitation pipeline. The guide tree is intentionally uncalibrated - only topology is needed for downstream BPP analysis. I am writing to request validation of our final configuration after an extended convergence debugging process.
Dataset:
Stepping-stone sampling result: UCLN relaxed clock was favored over strict clock across loci. We therefore cannot discard UCLN entirely.
Problem history:
Initially all 13 loci were UCLN with default priors. TreeHeight diverged across 12 orders of magnitude between independent replicates. Posterior showed catastrophic excursions to extreme values. ESS on TreeHeight remained at 3-5 after 450M states.
Attempted fixes that failed:
Root cause identified: The UCLN variance parameter (ucldStdev) provides a mathematical escape hatch: when ucldMean is fixed to 1.0 but ucldStdev is free, the MCMC drives ucldStdev to ~4.0, dropping the median branch rate to exp(-8) ≈ 0.00033, inflating TreeHeight by ~3000× while maintaining the same genetic distance. The coalescent likelihood reward for an inflated species tree overwhelms the prior penalty.
Final configuration (seeking validation):
Questions:
Is strict clock on COI + UCLN on 12 nuclear loci with ucldStdev ∈ [0,1] the correct approach for an uncalibrated StarBEAST3 guide tree? Is there a more standard solution we should have used?
Should the species-tree clock rate be explicitly fixed anywhere in the XML, or is the absence of a species-tree clock rate parameter sufficient?
Is the ucldStdev upper bound of 1.0 appropriate, or should it be tighter (0.5) or removed in favor of a tighter prior alone?
Are posterior calculation corrections (~1 per 1.5-3M states, magnitudes <200) from BEAGLE dynamic scaling on Apple M4 Max expected at this dataset scale, or do they indicate a remaining structural problem?
We have also consulted the Taming the BEAST StarBEAST3 tutorial and the StarBEAST3 GitHub documentation throughout this process.
Thank you for any guidance.
Jesse Barrington Masters Candidate, Computational Systematics CCNY
Hello again!
Following up on my previous post about uncalibrated StarBEAST3 UCLN guide tree convergence issues.
After resolving numerical instability (strict clock on COI anchor locus, beagle_scaling always, clockRate lower bound 0.001 on nuclear loci), the runs are now numerically clean — zero posterior corrections, negative likelihoods, TreeHeight in substitutions/site units.
However topology convergence is failing. After 200M states across 3 replicates:
Dataset: 36 species tree tips, 13 loci (1 strict clock COI + 12 UCLN nuclear AHE), 165 specimens in gene trees, Apple M4 Max, BEAST 2.7.7, StarBEAST3 v1.2.1, CoupledMCMC v1.2.2.
Questions: