On Tue, 2025-07-29 at 08:58 +0200, Ole Holm Nielsen via slurm-users
wrote:
> On 7/29/25 02:17, Christopher Samuel via slurm-users wrote:
> > On 7/28/25 9:58 am, Michael Gutteridge via slurm-users wrote:
> >
> > > I think the events you're looking for would be tracked in the
> > > events
> > > tables in the accounting database:
>
> Thanks, "sacctmgr show event where node=<nodename>" is extremely
> useful
> for monitoring nodes, and I wasn't aware of this command. I've added
> some
> further examples to my Wiki page now at
>
https://wiki.fysik.dtu.dk/Niflheim_system/Slurm_operations/#listing-node-events
Thanks for the replies; I was also not aware of that command and now
feel like I should have read the documentation better! That wiki is
also a nice resource.
> > Be aware that down and drainED nodes are there, but not drainING.
Noted; I think down and drained will give me what I'm looking for. We
do have monitoring of all our cluster that likely has the information
but this gives me something I use quickly from within the cluster etc.
Cheers,
Steve