sacctmgr show qos gpu-rtx format=MaxTRESPU%60
MaxTRESPU
-----------------------------------------------------
We have defined a user test that is assoc with this qos
sacctmgr show assoc user=test format=user,qos
Qos
gpu-rtx
Now we define another qos gpu-rtx-reserved that allows gpu=8
sacctmgr show qos gpu-rtx-reserved format=MaxTRESPU%60
MaxTRESPU
-----------------------------------------------------
parrtitionName=gpu-rtx6000-2 State=UP Nodes=g[15-29] MaxNodes=9 MaxTime=168:00:00 DefMemPerCPU=3996 AllowQos=gpu-rtx,gpu-rtx-reserved
But we found out that even though user is not assoc with gpu-rtx-reserved if the user uses gpu-rtx-reserved in his slurm script , he can reserve 8 gpu cards
So our question is , can the users assoc with one partition qos can use the other qos in the partition even if they are not associated with it . or in other words , we can only define one partition qos and not more then one.?
Hope i was able to explain ?
Any advice if we want partition to use more then one qos with different limits and users associated with one qos should not use other qos ?
Majid
| External email: Please exercise caution |
sacctmgr show account withassoc -p | grep gpu-rtx-reserved
default|default|default|uea_cluster||cjr13geu|1|||||||||||||||gpu,gpu-k40-1,gpu-rtx,gpu-rtx-reserved,hmem,ht,uea_def_qos|
sontrol show part gpu-rtx6000-2
PartitionName=gpu-rtx6000-2
AllowGroups=ALL AllowAccounts=ALL AllowQos=gpu-rtx,gpu-rtx-reserved,jakeuea
AllocNodes=ALL Default=NO QoS=N/A
DefaultTime=1-00:00:00 DisableRootJobs=NO ExclusiveUser=NO GraceTime=0 Hidden=NO
MaxNodes=9 MaxTime=7-00:00:00 MinNodes=0 LLN=NO MaxCPUsPerNode=UNLIMITED
Nodes=g[15-29]
PriorityJobFactor=1 PriorityTier=1 RootOnly=NO ReqResv=NO OverSubscribe=NO
OverTimeLimit=NONE PreemptMode=GANG,SUSPEND
State=UP TotalCPUs=720 TotalNodes=15 SelectTypeParameters=NONE
JobDefaults=(null)
DefMemPerCPU=3996 MaxMemPerNode=UNLIMITED
On a different note we have the following in slurm.conf
AccountingStorageUser=slurm
But we have been adding qos and assigning users as root ? Can this be an issue
Amjad