[slurm-users] Nodes are down after 2-3 minutes.

Chris Samuel chris at csamuel.org
Mon May 7 16:35:50 MDT 2018


On Tuesday, 8 May 2018 8:21:46 AM AEST Eric F. Alemany wrote:

> copied the /etc/munge/munge.key from the master to all the nodes.
> Checked the date on master and nodes - OK
> 
> systemctl restart slurmctld  on master
> 
> systemctl restart slurmd on all nodes

Did you restart munged as well?  That's what's reading the key, not Slurm.

Munge is just an external service that Slurm talks to.

cheers,
Chris
-- 
 Chris Samuel  :  http://www.csamuel.org/  :  Melbourne, VIC




More information about the slurm-users mailing list