[slurm-users] How to implement resource restriction for diffrent slurm users under same slurm account

384 views
Skip to first unread message

Hemanta Sahu

unread,
Oct 17, 2022, 10:53:15 AM10/17/22
to slurm...@lists.schedmd.com
Dear All,

     I want to implement resource restriction for different slurm users under
the same slurm account by setting  "GrpTRESMins" Flag. For testing purposes I set the "GrpTRESMins=cpu=0"  and submitted the Job.

  I expect the job submission should fail but it is not happening. The jobs are still going to Q and running.

Kindly help me if I am missing anything obvious. Command outputs given below for reference.
 

>>
[testfac3@login04 export_bin]$ sacctmgr  modify user name=testuser100 Account=testfac3_imf set GrpTRESMins=cpu=0
 Modified user associations...
  C = param-shakti A = testfac3_imf         U = testuser100
Would you like to commit changes? (You have 30 seconds to decide)
(N/y): y

[testuser100@login04 ~]$ sacctmgr show assoc where Account=testfac3_imf user=testuser100 format=Account%15,User%15,GrpTRESMins,QOS%30
        Account            User   GrpTRESMins                            QOS
--------------- --------------- ------------- ------------------------------
   testfac3_imf     testuser100         cpu=0                   testfac3_imf

[testuser100@login04 testuser100]$ sacctmgr show qos sacctmgr show qos testfac3_imf format=Name%20,MaxWall,Flags%20,GrpTRESMins%20,MaxSubmitJobsPerUser,MaxSubmitJobsPeraccount,GrpTRESRunMin,Priority
                Name     MaxWall                Flags          GrpTRESMins MaxSubmitPU MaxSubmitPA GrpTRESRunMin   Priority
-------------------- ----------- -------------------- -------------------- ----------- ----------- ------------- ----------
        testfac3_imf  3-00:00:00  DenyOnLimit,NoDecay        cpu=210000000         100         500                    10000
[testuser100@login04 testuser100]$

[testuser100@login04 testuser100]$ scontrol show job 949622|grep JobState
   JobState=COMPLETED Reason=None Dependency=(null)
[testuser100@login04 testuser100]$


[testuser100@login04 testuser100]$ cat testjob.sh
#!/bin/bash
#SBATCH -J testjob                # name of the job
#SBATCH -p standard               # name of the partition: available options "standard,standard-low,gpu,gpu-low,hm"
#SBATCH -n 2                      # no of processes
#SBATCH -q testfac3_imf
#SBATCH -A testfac3_imf
#SBATCH -t 01:00:00               # walltime in HH:MM:SS, Max value 72:00:00
#list of modules you want to use, for example
module load compiler/intel-mpi/mpi-2020-v4 compiler/intel/2020.4.304

#name of the executable
exe="uname -n"

#run the application
mpirun -n $SLURM_NTASKS $exe

[testuser100@login04 testuser100]$ sbatch testjob.sh
Submitted batch job 949622

[testuser100@login04 testuser100]$ squeue
             JOBID PARTITION     NAME     USER ST       TIME  NODES NODELIST(REASON)
            949622  standard testjob- testuser  R       0:04      2 cn[304-305]
>>

Thanks in advance

Best Regards
Hemanta

Ole Holm Nielsen

unread,
Oct 18, 2022, 1:42:01 AM10/18/22
to slurm...@lists.schedmd.com
On 10/17/22 16:51, Hemanta Sahu wrote:
>      I want to implement resource restriction for different slurm users
> under
> the same slurm account by setting  "GrpTRESMins" Flag. For testing
> purposes I set the "GrpTRESMins=cpu=0"  and submitted the Job.
>
>   I expect the job submission should fail but it is not happening. The
> jobs are still going to Q and running.
>
> Kindly help me if I am missing anything obvious. Command outputs given
> below for reference.

Job submission should not fail due to resource limits.

Read the slurm.conf manual page to make sure you have set this parameter
correctly, for example:

$ scontrol show config | grep AccountingStorageEnforce
AccountingStorageEnforce = associations,limits,qos,safe

You should also read this documentation:
https://slurm.schedmd.com/resource_limits.html

I assume that you have configured a Slurm database?

/Ole

Reply all
Reply to author
Forward
0 new messages