[slurm-users] Nodes are down after 2-3 minutes.

Eric F. Alemany ealemany at stanford.edu
Mon May 7 16:38:47 MDT 2018


Hi Chris

I thought i did but I will do it again

Best,
Eric
_____________________________________________________________________________________________________

Eric F.  Alemany
System Administrator for Research

Division of Radiation & Cancer  Biology
Department of Radiation Oncology

Stanford University School of Medicine
Stanford, California 94305

Tel:1-650-498-7969<tel:1-650-498-7969>  No Texting
Fax:1-650-723-7382<tel:1-650-723-7382>



On May 7, 2018, at 3:35 PM, Chris Samuel <chris at csamuel.org<mailto:chris at csamuel.org>> wrote:

On Tuesday, 8 May 2018 8:21:46 AM AEST Eric F. Alemany wrote:

copied the /etc/munge/munge.key from the master to all the nodes.
Checked the date on master and nodes - OK

systemctl restart slurmctld  on master

systemctl restart slurmd on all nodes

Did you restart munged as well?  That's what's reading the key, not Slurm.

Munge is just an external service that Slurm talks to.

cheers,
Chris
--
Chris Samuel  :  http://www.csamuel.org/  :  Melbourne, VIC



-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20180507/02f1c096/attachment.html>


More information about the slurm-users mailing list