I am working with large datasets (1000-1500 sequences, each >15,000-20,000 bp) and running Bayesian phylogeographic analyses using BEAST. However, my analyses are running very slowly even with the beagle. 400 million MCMC takes a 3-4weeks (sometimes more than a month)) on our current Linux system.
My current System Specifications are here:
Processor = intel Xeon (R) Gold 6242 CPU @2.80GHz x 32
Graphics= NVIDiA corporation TU102GL [quadro RTx6000/8000] (Quadro RTx 6000)
I am considering two options to optimize performance:
1. Upgrading our system (e.g., adding more CPUs and RAM).
2. Purchasing another system with similar specs and splitting MCMC chains across multiple machines, then combining them using LogCombiner.
For those who have worked with similar large datasets, what setup do you recommend for faster convergence? Are there specific computational strategies that significantly improve speed?
Thank you in advance for your insights
--
You received this message because you are subscribed to the Google Groups "beast-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to beast-users...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/beast-users/9f75f9c9-f539-42a5-90a7-67d8ccd10ea6n%40googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/beast-users/CANT%2BQ3dEW024-8CgrxZDWWzxoiCfGKZUPmOu9JgLUr-sbyPnYQ%40mail.gmail.com.