[slurm-users] SLURM: reconfig

Ole Holm Nielsen Ole.H.Nielsen at fysik.dtu.dk
Thu May 5 14:43:56 UTC 2022



On 5/5/22 16:08, Mark Dixon wrote:
> On Thu, 5 May 2022, Ole Holm Nielsen wrote:
> ...
>> That is correct.  Just do "scontrol reconfig" on the slurmctld server.  If
>> all your slurmd's are truly running Configless[1], they will pick up the
>> new config and reconfigure without restarting.
>>
>> Details are summarized in
>> https://wiki.fysik.dtu.dk/niflheim/Slurm_configuration#reconfiguration-of-slurm-conf. 
>>
>> Beware that you can't add or remove nodes without restarting.  Also,
>> changing certain slurm.conf parameters require restarting.
> ...
> 
> However...
> 
> Given that the normal recommendation for adding/removing nodes safely is to:
> 
> * stop slurmctld
> * edit slurm.conf etc.
> * restart the slurmd nodes to pick up new slurm.conf
> * start slurmctld
> 
> I'm confused how this is supposed to be achieved in a configless setting, 
> as slurmctld isn't running to distribute the updated files to slurmd.

You're right, probably the correct order for Configless must be:

* stop slurmctld
* edit slurm.conf etc.
* start slurmctld
* restart the slurmd nodes to pick up new slurm.conf

See also slides 29-34 in 
https://slurm.schedmd.com/SLUG21/Field_Notes_5.pdf from the Slurm 
publications site https://slurm.schedmd.com/publications.html

Less-Safe, but usually okay, procedure:
1. Change configs
2. Restart slurmctld
3. Restart all slurmd processes really quickly


/Ole




More information about the slurm-users mailing list