[slurm-users] Can't get node out of drain state

Alex Chekholko alex at calicolabs.com
Thu Jan 23 21:31:21 UTC 2020


Hey Dean,

Does 'scontrol show node <nodename' give any "Reason:"?  You can also look
at 'sinfo -R'.

Make sure the relevant network ports are open:
https://wiki.fysik.dtu.dk/niflheim/Slurm_configuration#configure-firewall-for-slurm-daemons

Also check that slurmd daemons on the compute nodes can talk to each other
(not just to the master). e.g. bottom of
https://slurm.schedmd.com/big_sys.html

Regards,
Alex

On Thu, Jan 23, 2020 at 1:05 PM Dean Schulze <dean.w.schulze at gmail.com>
wrote:

> I've tried the normal things with scontrol (
> https://blog.redbranch.net/2015/12/26/resetting-drained-slurm-node/), but
> I have a node that will not come out of the drain state.
>
> I've also done a hard reboot and tried again.  Are there any other
> remedies?
>
> Thanks.
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20200123/8a858ccf/attachment.htm>


More information about the slurm-users mailing list