[slurm-dev] problem with slurm job step creation

3 views
Skip to first unread message

yogendra...@wipro.com

unread,
May 28, 2014, 1:05:13 PM5/28/14
to slurm-dev

Hi,

 

I am facing below error with builtin  scheduling in shared mode of GPUs (cons_resource,CR_CORE_MEMORY) .

 

Facing below message and  in output file.

 

 

"srun: Job step creation temporarily disabled, retrying"

 

Thanks in  Advance.

 

--

Regards,

Yogendra Sharma

 

 

 

The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you should not disseminate, distribute or copy this e-mail. Please notify the sender immediately and destroy all copies of this message and any attachments.

WARNING: Computer viruses can be transmitted via email. The recipient should check this email and any attachments for the presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email.

www.wipro.com

Marcin Stolarek

unread,
May 30, 2014, 5:44:13 AM5/30/14
to slurm-dev
2014-05-28 19:05 GMT+02:00 <yogendra...@wipro.com>:

Hi,

 

I am facing below error with builtin  scheduling in shared mode of GPUs (cons_resource,CR_CORE_MEMORY) .

 

Facing below message and  in output file.

 

 

"srun: Job step creation temporarily disabled, retrying"

Are you trying with interactive job?

I've seen such a problem with gres and interactive job, batch jobs were working just fine, but I hanven't time to examine this deeply.

cheers,
marcin

 

Thanks in  Advance.

 

--

Regards,

Yogendra Sharma

 

 

 

The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you should not disseminate, distribute or copy this e-mail. Please notify the sender immediately and destroy all copies of this message and any attachments.

WARNING: Computer viruses can be transmitted via email. The recipient should check this email and any attachments for the presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email.

www.wipro.com




--
Marcin Stolarek
Interdisciplinary Centre for Mathematical and Computational Modelling (ICM),
University of Warsaw, Poland

yogendra...@wipro.com

unread,
May 30, 2014, 6:29:20 AM5/30/14
to slurm-dev

[2014-05-28T11:42:06+05:30] error: gres/gpu: step_test 1279.4294967294 gres_bit_alloc is NULL

[2014-05-28T11:42:06+05:30] error: gres/gpu: step_test 1279.4294967294 gres_bit_alloc is NULL

[2014-05-28T11:42:06+05:30] _slurm_rpc_job_step_create for job 1279: Requested nodes are busy

 

Above are the logs in /var/log/slurm/slurmctld.log .  Can anyone please help me with this.

 

 

--

Thanks,

Yogendra

 

From: Yogendra Kumar Sharma (WI01 - GIS - IT & ITES)
Sent: Wednesday, May 28, 2014 10:18 PM
To: slurm-dev
Subject: problem with slurm job step creation

 

Hi,

 

I am facing below error with builtin  scheduling in shared mode of GPUs (cons_resource,CR_CORE_MEMORY) .

 

Facing below message and  in output file.

 

 

"srun: Job step creation temporarily disabled, retrying"

 

Thanks in  Advance.

 

--

Regards,

Yogendra Sharma

 

 

 

Reply all
Reply to author
Forward
0 new messages