MrBayea issue

43 views
Skip to first unread message

YU DENG

unread,
Nov 19, 2024, 11:55:49 AM11/19/24
to CIPRES Science Gateway Users
Hello, PhD,

I am a graduate student in the field of life evolution systems from China. I recently encountered some problems in the MrBayea tree building on CIPRES. The details are as follows:

/home/cipres/ngbw/contrib/tools/bin/mrbayes_3.2.7a: line 15: 1045240 Killed                  ${exe} < $paramfile
slurmstepd: error: Detected 1 oom_kill event in StepId=35221154.0. Some of the step tasks have been OOM Killed.
srun: error: exp-1-04: task 6: Out Of Memory
srun: Terminating StepId=35221154.0
slurmstepd: error: *** STEP 35221154.0 ON exp-1-04 CANCELLED AT 2024-11-19T03:40:16 ***

How can I solve it?

My CIPRES account name is Xiangxingxing, and the problem occurred in the latest project.

Best regards,
Xingxing Xiang

Mark Miller

unread,
Nov 19, 2024, 12:05:02 PM11/19/24
to CIPRES Science Gateway Users
Hi there Xingxing Xiang,
Thanks for reporting the issue. We had to move MrBayes runs to different cluster (Expanse) this week because of a maintenance issue on Popeye.
Unfortunately, Expanse has less memory per core. We are looking into a fix. I will get back to you shortly.
Mark

Mark Miller

unread,
Nov 19, 2024, 3:41:24 PM11/19/24
to CIPRES Science Gateway Users
Hi again,
We discussed this internally, and the best solution we can offer at this moment is to resubmit the job after about 5 PM Pacific time,Tuesday, 11/26.
We will post an update here and on the web site when the Popeye is back in service.
Sorry for the inconvenience.

Mark

YU DENG

unread,
Nov 30, 2024, 9:53:56 AM11/30/24
to CIPRES Science Gateway Users
Hello, Doctor:
I have run BI after 5 PM Pacific time, Tuesday, 11/26, but the following problem still occurs. Why?

/home/cipres/ngbw/contrib/tools/bin/mrbayes_3.2.7a: line 15: 2321074 Killed                  ${exe} < $paramfile
slurmstepd: error: Detected 1 oom_kill event in StepId=35419862.0. Some of the step tasks have been OOM Killed.
srun: error: exp-1-04: task 2: Out Of Memory
srun: Terminating StepId=35419862.0
slurmstepd: error: *** STEP 35419862.0 ON exp-1-04 CANCELLED AT 2024-11-29T10:26:24 ***

Thank you very much for your answer

2024年11月20日水曜日 4:41:24 UTC+8 mmi...@ucsd.edu:

Pfeiffer, Wayne

unread,
Nov 30, 2024, 10:34:59 AM11/30/24
to YU DENG, CIPRES Science Gateway Users, Pfeiffer, Wayne
Hi,

The Popeye computer with more memory is up again, but our programmer who needs to move MrBayes job submissions back to Popeye is away on vacation. He will move jobs to Popeye when he returns on Monday, December 2. We are sorry for the inconvenience.

Wayne

-- 
You received this message because you are subscribed to the Google Groups "CIPRES Science Gateway Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cipres-science-gatew...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/cipres-science-gateway-users/e0dc7666-9cfe-411e-ae52-9efa43e99d04n%40googlegroups.com.

Pfeiffer, Wayne

unread,
Dec 1, 2024, 11:06:25 PM12/1/24
to YU DENG, CIPRES Science Gateway Users, Pfeiffer, Wayne
Hi,

MrBayes jobs are running on Popeye again :)

Wayne
Reply all
Reply to author
Forward
0 new messages