[slurm-users] How to Make AvailableFeatures Persist after Slurmctld Restart

136 views
Skip to first unread message

Hanby, Mike

unread,
Jun 2, 2022, 3:06:20 PM6/2/22
to slurm...@lists.schedmd.com

Howdy,

 

I can’t seem to find a solution in ‘man slurm.conf’ for this. How can I make the following persist a slurmctld restart:

 

scontrol update NodeName="c001" AvailableFeatures=hi_mem,data,scratch

 

NodeName=c001 Arch=x86_64 CoresPerSocket=12

   CPUAlloc=2 CPUTot=48 CPULoad=6.08

   AvailableFeatures=hi_mem,data,scratch

   ActiveFeatures=hi_mem,data,scratch

 

Thanks for any insight,

 

Mike

Brian Andrus

unread,
Jun 2, 2022, 3:11:18 PM6/2/22
to slurm...@lists.schedmd.com

Add it to your slurm.conf

Then it is always there after a restart.

Brian Andrus

Sarlo, Jeffrey S

unread,
Jun 2, 2022, 3:11:26 PM6/2/22
to Slurm User Community List

In slurm.conf, we just add the Features to the node description.  Is that what you were looking for?

 

NodeName=compute-4-4 … Weight=15 Feature=gen10

 

Jeff

UH IT - HPC

Hanby, Mike

unread,
Jun 2, 2022, 4:56:16 PM6/2/22
to Slurm User Community List

Ah, thank you. I was assuming it would use the same name as in scontrol

 

Per ‘man slurm.conf’ :

 

Feature:

A  comma  delimited  list of arbitrary strings indicative of some characteristic associated with the node.  There is no value associated with a feature at this time, a node either has a feature or it does not.  If desired a feature may contain a numeric component indicating, for example, processor speed.  By default a node has no features.  Also see Gres.

 

‘man scontrol’

 

AvailableFeatures=<features>:

Identify the feature(s) available on the specified node.  Any previously defined available feature specification will be overwritten with the new value.  AvailableFeatures assigned via scontrol will only persist across  the restart  of  the slurmctld daemon with the -R option and state files preserved or slurmctld's receipt of a SIGHUP.  Update slurm.conf with any changes meant to be persistent across normal restarts of slurmctld or the execution of scontrol reconfig.

 

Thanks!

Reply all
Reply to author
Forward
0 new messages