Fixing / Adding new partition to an existing cluster

25 views
Skip to first unread message

fionn malone

unread,
Apr 13, 2022, 6:21:37 PM4/13/22
to google-cloud-slurm-discuss
Hi,

Is it possible to reconfigure a slurm cluster to either modify an existing partition or add a new partition?

For example, I have a cluster setup but need to edit the number of cpus. It seems possible to edit the config.yaml and rerun setup.py but I'm not sure what the correct workflow is and don't want to clobber what I have.

Along the same lines, is it possible to add a new partition with say A100 GPUs? I've tried separately to do so but couldn't figure out the appropriate configuration:

machine_type = "a2-highgpu-8g",
gpu_type = 8
gpu_type = null
cpu_platform = "Intel Cascade Lake"

results in a salloc node failure which I think means this configuration is incorrect.

Thanks,

Fionn.
Reply all
Reply to author
Forward
0 new messages