[slurm-users] Nodes stuck in drain state
Roger Mason
rmason at mun.ca
Thu May 25 17:20:00 UTC 2023
Hello,
"Groner, Rob" <rug262 at psu.edu> writes:
> A quick test to see if it's a configuration error is to set
> config_overrides in your slurm.conf and see if the node then responds
> to scontrol update.
Thanks to all who helped. It turned out that memory was the issue. I
have now reseated the RAM in the offending node and all seems well.
I have another node also stuck in drain that I will investigate. I
picked up some useful tips from the replies, but if I can't get it back
on-line I hope the friendly people on this list will rescue me.
Thanks again,
Roger
More information about the slurm-users
mailing list