Loris Bennett via slurm-users
unread,Jun 5, 2024, 3:25:54 AM6/5/24Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Ryan Novosielski via slurm-users, Robert Kudyba, Ryan Novosielski
Ryan Novosielski via slurm-users <
slurm...@lists.schedmd.com> writes:
> We do have bf_continue set. And also bf_max_job_user=50, because we discovered that one user can submit so many jobs that it will hit the limit of the number
> it’s going to consider and not run some jobs that it could otherwise run.
>
> On Jun 4, 2024, at 16:20, Robert Kudyba <
rku...@fordham.edu> wrote:
>
> Thanks for the quick response Ryan!
>
> Are there any recommendations for bf_ options from
https://slurm.schedmd.com/sched_config.html that could help with this? bf_continue? Decreasing
> bf_interval= to a value lower than 30?
Your bf_window may be too small. From 'man slurm.conf':
bf_window=#
The number of minutes into the future to look when considering
jobs to schedule. Higher values result in more overhead and
less responsiveness. A value at least as long as the highest
allowed time limit is generally advisable to prevent job
starvation. In order to limit the amount of data managed by
the backfill scheduler, if the value of bf_window is increased,
then it is generally advisable to also increase bf_resolution.
This option applies only to SchedulerType=sched/backfill.
Default: 1440 (1 day), Min: 1, Max: 43200 (30 days).
--
Dr. Loris Bennett (Herr/Mr)
FUB-IT (ex-ZEDAT), Freie Universität Berlin