[slurm-users] Tracking efficiency of all jobs on the cluster (dashboard etc.)
Will Furnell - STFC UKRI
will.furnell at stfc.ac.uk
Mon Jul 24 14:37:42 UTC 2023
I am aware of 'seff', which allows you to check the efficiency of a single job, which is good for users, but as a cluster administrator I would like to be able to track the efficiency of all jobs from all users on the cluster, so I am able to 're-educate' users that may be running jobs that have terrible resource usage efficiency.
What do other cluster administrators use for this task? Is there anything you use and recommend (or don't recommend) or have heard of that is able to do this? Even if it's something like a Grafana dashboard that hooks up to the SLURM database,
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the slurm-users