Combine mcmctree analysis

14 views
Skip to first unread message

Adrian Luczon

unread,
Apr 29, 2026, 4:53:18 AMApr 29
to PAML discussion group
Hello

I have a 1.4 billion basepair alignment that I would like to estimate the divergence time. I ran it using mcmctree on an HPC, but I always run out of memory. I then cut my alignment into smaller 30 million bp chunks. I tried the mcmctree analysis on some of these chunks, and it finally worked. 

I would like to ask if there's any way to combine the results from mcmctree analyses of these different chunks to get a divergence estimate. 

Thank you for the advice.

Adrian

Sishuo Wang

unread,
May 5, 2026, 10:09:14 PMMay 5
to PAML discussion group
Hi Adrian,

Thx for the Q! May i ask if you got memory issue when running on the whole aln but you didn't get any error when splitting the whole into several alns?

If so, a trivial but always useful way is to find a computer with a larger memory.

Alt, it would be good to use the approx method usedata=3. In this way, you can try splitting your data into a few partitions according to some criteria. Then, run codeml separately to get the gradient and hessian, and merge them into a single in.BV file with the gradient and hessian from each partition, followed by a subsequent mcmctree analysis for dating. So the trick is to split the whole aln into partitions as CODEML on each "small" partition will lower the memory need.

for more info, you can have a look at the nicely tutorial written by the development team

best,
sishuo

Reply all
Reply to author
Forward
0 new messages