[slurm-users] Nodes do not return to service after scontrol reboot
Chris Samuel
chris at csamuel.org
Thu Jun 18 06:35:50 UTC 2020
On 17/6/20 11:32 pm, David Baker wrote:
> Thank you for your comments. The scontrol reboot command is now working
> as expected.
Fantastic!
For those who don't know, using scontrol reboot in this way also allows
Slurm to take these rebooting nodes into account for scheduling; so if
you have a large job needing a lot of nodes waiting to begin with high
priority and you need to reboot some nodes then Slurm won't give up on
them and put smaller jobs on the system on all the other nodes, delaying
the larger job for no good reason.
All the best,
Chris
--
Chris Samuel : http://www.csamuel.org/ : Berkeley, CA, USA
More information about the slurm-users
mailing list