Dear All,
I am new user, trying out SLURM
Like to check if the SLURM has a GUI/web based management tool also
Thanks
Joseph John
Thanks Ole
I was able to setup the SLURM for 4 nodes and tried out some python code using srun and trying to understand and practice more of SLURM commands
Thanks for the reply
Joseph John
> can you please advice me on the monitoring tools, I
I'm _somehow_ satisfied with:
Prometheus Slurm exporter - (
https://github.com/vpenso/prometheus-slurm-exporter),
being grabbed by Telegraf - (
https://www.influxdata.com/time-series-platform/telegraf )
sending metrics to InfluxDB.
Visualisation is done by Grafana.
HTH and I'll be happy to hear about other ways, especially collectors, providing consistent state of slurm and its partition usage.. Mentioned exporter is not capturing well transitions, and aggregating visualizations have glitches.
But for high-level overview / quantitative view at system it's
enough.
josef