[slurm-users] Add new compute node without interruption

455 views
Skip to first unread message

Microbiome Studio

unread,
Dec 13, 2021, 12:56:17 PM12/13/21
to slurm...@lists.schedmd.com
Dear,

Firstly thanks slurm devloper for your amazing works.

We would like to know if it is planned to add this feature: 
Adding new compute node without interruption

Indeed actually we have to stop compution, declare new nodes and resume
the computation. such feature would be really helpfull with the growth
of cloud computation.



Thanks


Best regards


Paul Brunk

unread,
Dec 13, 2021, 2:03:04 PM12/13/21
to Slurm User Community List
Hi:

Normally, adding a new node requires altering slurm.conf, and restarting slurmctld, and slurmd on each node.
Restarting these daemons should not harm jobs and can be done while existing jobs are running.

Wishing that I’d just listened this time,
Paul Brunk, system administrator, Workstation Support Group
GACRC (formerly RCC)
EITS (formerly UCNS)
University of Georgia

-----Original Message-----
From: slurm-users <slurm-use...@lists.schedmd.com> On Behalf Of Microbiome Studio
Sent: Monday, December 13, 2021 12:55
To: slurm...@lists.schedmd.com
Subject: [slurm-users] Add new compute node without interruption

[EXTERNAL SENDER - PROCEED CAUTIOUSLY]

Brian Andrus

unread,
Dec 13, 2021, 2:11:45 PM12/13/21
to slurm...@lists.schedmd.com
Indeed, this is accurate.

We regularly add nodes on the fly (cloud based cluster).

All that is need is to get them all set in the slurm.conf, restart
slurmctld and do 'scontrol reconfigure'


Brian Andrus

Ole Holm Nielsen

unread,
Dec 13, 2021, 2:13:14 PM12/13/21
to slurm...@lists.schedmd.com
On 13-12-2021 18:55, Microbiome Studio wrote:
> We would like to know if it is planned to add this feature:
> Adding new compute node without interruption
>
> Indeed actually we have to stop compution, declare new nodes and resume
> the computation. such feature would be really helpfull with the growth
> of cloud computation.

I've collected some well-known information about adding and removing
nodes here: https://wiki.fysik.dtu.dk/niflheim/SLURM#add-and-remove-nodes

With Slurm we can add or remove nodes without any interruption of jobs.

/Ole

Reply all
Reply to author
Forward
0 new messages