[slurm-users] How to weight nodes properly based on RAM to prevent small instances clogging big instances with backfilling

7 views
Skip to first unread message

Xaver Stiensmeier via slurm-users

unread,
May 28, 2026, 10:40:53 AMMay 28
to slurm...@lists.schedmd.com
Dearl Slurm User List,

I am currently reviewing a slurm.conf where the developer set Weight
manually to attribute a greater weight to machines that have more RAM to
force smaller jobs on smaller instances. However, I feel like there is
something already in place or better than manually setting the weights,
but I couldn't find it.

If I understand correctly Slurm does not schedule jobs to the smallest
possible node on default. So small jobs can be scheduled to large
instances and a big job might have to wait indefinitely when using
backfilling.

I thought that Slurm does have mechanisms to prevent this but was unable
to find it again in the documentation.

Is there really no automatism at place or am I overlooking something?

Best,
Xaver

OpenPGP_0x2DB91A0E5E88D181.asc
OpenPGP_signature.asc

Davide DelVento via slurm-users

unread,
May 28, 2026, 5:31:09 PMMay 28
to Xaver Stiensmeier, slurm...@lists.schedmd.com
IIRC the thing you can't find is simply the order in which the nodes are listed in the partition definition (or alphanumerical order if they are added en masse with square brackets)

--
slurm-users mailing list -- slurm...@lists.schedmd.com
To unsubscribe send an email to slurm-us...@lists.schedmd.com
Reply all
Reply to author
Forward
0 new messages