[slurm-users] Set a ramdom offset when starting node health check in SLURM

Bjørn-Helge Mevik b.h.mevik at usit.uio.no
Fri Nov 27 08:35:24 UTC 2020


You can also check out

HealthCheckNodeState=CYCLE

man slurm.conf:

"Rather than running the health check program on all nodes at the same
time, cycle through running on all compute nodes through the course of
the HealthCheckInterval. May be combined with the various node state
options."

-- 
Cheers,
Bjørn-Helge Mevik, dr. scient,
Department for Research Computing, University of Oslo
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 832 bytes
Desc: not available
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20201127/ef0ee081/attachment.sig>


More information about the slurm-users mailing list