We have added a few nodes to the express queue, thus it now has 8 nodes. For now..
Regarding configuration options. The challenge is to find the configuration for an express queue, which has the fewest negative consequences. There might be one better than the current version.
Let's review.
Current configuration:
* Jobs are automatically submitted to both normal and express, if the job has a wall time of 1 hour or less. 8 nodes in the express queue.
(neg) If many jobs are submitted with a wall time of less than 1 hour, then they may block the express queue for some hours.
(pos) 8 nodes available for short jobs, thus real work can be done on the express queue.
Configuration alternatives:
* No jobs are automatically sent to the express queue. 2 nodes in the express queue.
(neg) The nodes in the express queue can not as easily be utilised for computations, thus less nodes are reserved for this queue.
(pos) They will not be blocked by short jobs sent to the normal queue
* Limit the express queue, such that no user can have more than 2 jobs in the queue and no jobs are automatically sent to the express queue. 2 nodes reserved.
(neg) They would likely be unused most of the time. Especially doing non-work hours.
(pos) No user would be able to block the express queue for more than an hour.
* No jobs are automatically sent to the express queue. All jobs sent to express, will additionally be sent to normal. 2 nodes reserved.
(pos) Users blocking the express queue with many jobs, would be doing it intentionally, thus they could be reprimanded.
I prefer the last configuration on this list. What do you think?
/ Rune