[slurm-users] High log rate on messages like "Node nodeXX has low real_memory size"
b.h.mevik at usit.uio.no
Thu May 12 12:18:44 UTC 2022
Per Lönnborg <perlon at passagen.se> writes:
> is there a way to lower the log rate on error messages in slurmctld for nodes with hardware errors?
You don't say which version of Slurm you are running, but I think this
was changed in 21.08, so the node will only try to register once if it
has too little memory, thus only giving one such message. (The node
will then hva state "inval" in sinfo.)
Bjørn-Helge Mevik, dr. scient,
Department for Research Computing, University of Oslo
-------------- next part --------------
A non-text attachment was scrubbed...
Size: 832 bytes
Desc: not available
More information about the slurm-users