[slurm-users] Tracking efficiency of all jobs on the cluster (dashboard etc.)

Angel de Vicente angel.de.vicente at iac.es
Thu Sep 7 18:15:51 UTC 2023


Hi Will,

Will Furnell - STFC UKRI <will.furnell at stfc.ac.uk> writes:

> That does sound like an interesting solution – yes please would you be
> able to send me (or us if you’re willing to share it to the list)
> through some more information please?
>
> And thank you everyone else that has replied to my email – there’s
> definitely a few solutions I need to look into here!

we also use 'seff', but it gives reliable stats only for jobs that
finished properly (i.e. COMPLETED). In our case, we would need to
collect efficiency stats also for jobs that TIMEOUT and even those that
are CANCELLED.

Do you happen to know of some way to accomplish this?

Many thanks,
-- 
Ángel de Vicente
 Research Software Engineer (Supercomputing and BigData)
 Tel.: +34 922-605-747
 Web.: http://research.iac.es/proyecto/polmag/

 GPG: 0x8BDC390B69033F52
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 5877 bytes
Desc: not available
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20230907/efe9c731/attachment-0001.bin>


More information about the slurm-users mailing list