[slurm-users] Is there a scontrol ping slurmdbd?

493 views
Skip to first unread message

Heitor

unread,
Jun 9, 2021, 5:24:06 PM6/9/21
to slurm...@lists.schedmd.com
Hello,

The docs about scontrol says that the command `scontrol ping` allows
one to query if slurmctld nodes are up and running.

I'm wondering if there's something analogous for slurmdbd? A command to
check if slurmdbd nodes are up and running? I couldn't find it in the
docs.

Kind regards,
Heitor

Christoph Brüning

unread,
Jun 10, 2021, 2:54:42 AM6/10/21
to slurm...@lists.schedmd.com
Hello,

I'd usually use some simple sacctmgr command.
Something like "sacctmgr list cluster".

That said, we're running a single slurmdbd instance since slurmctld does
some caching etc. Do you have multiple that you need to check individually?

Cheers,
Christoph
--
Dr. Christoph Brüning
Universität Würzburg
HPC & DataManagement @ ct.qmat & RZUW
Am Hubland
D-97074 Würzburg
Tel.: +49 931 31-80499

Sean Crosby

unread,
Jun 10, 2021, 3:21:26 AM6/10/21
to slurm...@lists.schedmd.com
We use sacctmgr list stats for our Slurmdbd check

Our Nagios check is

RESULT=$(/usr/local/slurm/latest/bin/sacctmgr list stats)
if [ $? -ne 0 ]
then
        echo "ERROR: cannot connect to database"
        exit 2
fi
echo "$RESULT" | head -n 4
exit 0

Sean

From: slurm-users <slurm-use...@lists.schedmd.com> on behalf of Christoph Brüning <christoph...@uni-wuerzburg.de>
Sent: Thursday, 10 June 2021 16:54
To: slurm...@lists.schedmd.com <slurm...@lists.schedmd.com>
Subject: [EXT] Re: [slurm-users] Is there a scontrol ping slurmdbd?
 
External email: Please exercise caution

Heitor

unread,
Jun 10, 2021, 7:39:24 PM6/10/21
to slurm...@lists.schedmd.com
This was exactly what I needed! Thank you Sean and Christoph!
Reply all
Reply to author
Forward
0 new messages