Hello Everyone,
We would like to improve our visibility on our cluster usage.
We have ganglia, and use sacct actually, but I was wondering if there was a web tool recommended to have both monitoring and accounting (user and admin friendly) ?
Thanks in advance
Christine
Something I have been impressed with is Netdata
It is in the standard repositories and will auto-detect quite a bit of things on a node. It is great for real-time monitoring of a node/job.
I also use Prometheus and Grafana for historic data (anything over 5 minutes).
Brian Andrus
Hello,
Thanks for your feedback.
I’ve tried xdmod, but after a lot of debugging it is still not working, and the support is not very responsive.
If there a more recent feedback on any accounting tool ?
Thanks in advance,
Christine
De : slurm-users <slurm-use...@lists.schedmd.com>
De la part de Davide DelVento
Envoyé : vendredi 5 mai 2023 15:19
À : Slurm User Community List <slurm...@lists.schedmd.com>
Objet : Re: [slurm-users] monitoring and accounting
Hi, we use grafana with influx, it is easy to install and works fine