On 5/17/24 05:16, Ratnasamy, Fritz via slurm-users wrote:
What is the "official" process to remove nodes safely? I have drained the nodes so jobs are completed and put them in down state after they are completely drained. I edited the slurm.conf file to remove the nodes. After some time, I can see that the nodes were removed from the partition with the command sinfo
However, I was told I might need to restart the service slurmctld, do you know if it is necessary? Should I also run scontrol reconfig?
The SchedMD presentations in https://slurm.schedmd.com/publications.html describe node add/remove.
I've collected my notes on this in the Wiki page https://wiki.fysik.dtu.dk/Niflheim_system/Slurm_operations/#add-and-remove-n...
/Ole