Can slurm (slurmctld) run with zero defined partitions in slurm.conf?

Blake Fitch

Nov 29, 2022, 4:02:07 PM
I am seing the following error when bringing up a cluster before parittion are defined.

[2022-11-29T21:39:33.595] debug: Reading slurm.conf file: /etc/slurm/slurm.conf [2022-11-29T21:39:33.596] No memory enforcing mechanism configured. [2022-11-29T21:39:33.597] topology/none: init: topology NONE plugin loaded [2022-11-29T21:39:33.597] debug: No DownNodes [2022-11-29T21:39:33.597] fatal: No PartitionName information available!
[2022-11-29T21:39:33.599] slurmscriptd: debug: _slurmscriptd_mainloop: finished

Does this mean that slurmctld service can not be brought up until after partitions are defined?

I'd kind of like to define a head node, including slurm config, and then define compute resources and add them to the existing config.

Thanks for any insights!


Olivier Martin

Nov 29, 2022, 7:59:34 PM
to Blake Fitch, google-cloud-slurm-discuss
Hi Blake,

What you could try, to do that, would be to use the latest slurm on gcp (I believe version 5.2), and enable the `enable_reconfigure` field. This, through the enabling of a pub/sub queue, would allow you to push, using terraform, changes to your partitions and configurations and probably allow you to do what you are looking for?

