[slurm-users] network/communication failure
Eric F. Alemany
ealemany at stanford.edu
Mon May 21 09:31:51 MDT 2018
I had the same issue although the system clocks were the same on the master and execute nodes. However, I was told to try to configure NTP (network time protocol).
That did the trick for me.
Eric F. Alemany
System Administrator for Research
Division of Radiation & Cancer Biology
Department of Radiation Oncology
Stanford University School of Medicine
Stanford, California 94305
Tel:1-650-498-7969<tel:1-650-498-7969> No Texting
On May 21, 2018, at 08:26, Miguel Gutiérrez Páez <mgutierrez at gmail.com<mailto:mgutierrez at gmail.com>> wrote:
selinux? What does getenforce reports?
El lun., 21 may. 2018 17:17, Fulcomer, Samuel <samuel_fulcomer at brown.edu<mailto:samuel_fulcomer at brown.edu>> escribió:
Is there a firewall turned on? What does "iptables -L -v" report on the three hosts?
On Mon, May 21, 2018 at 11:05 AM, Turner, Heath <Hturner at eng.ua.edu<mailto:Hturner at eng.ua.edu>> wrote:
If anyone has advice, I would really appreciate...
I am running (just installed) slurm-11.17.6, with a master + 2 hosts. It works locally on the master (controller + execution). However, I cannot establish communication from master [triumph01] with the 2 hosts [triumph02,triumph03]. Here is some more info:
1. munge is running, and munge verification tests all pass.
2. system clocks are in sync on master/hosts.
3. identical slurm.conf files are on master/hosts.
4. configuration of resources (memory/cpus/etc) are correct and have been confirmed on all machines (all hardware is identical).
5. I have attached:
b) log file from master slurmctld
c) log file from host slurmd
Any ideas about what to try next?
Chemical and Biological Engineering
University of Alabama
3448 SEC, Box 870203
Tuscaloosa, AL 35487
(205) 348-1733 (phone)
(205) 561-7450 (cell)
(205) 348-7558 (fax)
hturner at eng.ua.edu<mailto:hturner at eng.ua.edu>
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the slurm-users