[slurm-users] High log rate on messages like "Node nodeXX has low real_memory size"

Bjørn-Helge Mevik b.h.mevik at usit.uio.no
Thu May 12 12:18:44 UTC 2022


Per Lönnborg <perlon at passagen.se> writes:

> Greetings,

God dag!

> is there a way to lower the log rate on error messages in slurmctld for nodes with hardware errors? 

You don't say which version of Slurm you are running, but I think this
was changed in 21.08, so the node will only try to register once if it
has too little memory, thus only giving one such message.  (The node
will then hva state "inval" in sinfo.)

-- 
Cheers,
Bjørn-Helge Mevik, dr. scient,
Department for Research Computing, University of Oslo

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 832 bytes
Desc: not available
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20220512/309208da/attachment.sig>


More information about the slurm-users mailing list