[slurm-users] network/communication failure
Fulcomer, Samuel
samuel_fulcomer at brown.edu
Mon May 21 09:14:32 MDT 2018
Is there a firewall turned on? What does "iptables -L -v" report on the
three hosts?
On Mon, May 21, 2018 at 11:05 AM, Turner, Heath <Hturner at eng.ua.edu> wrote:
> If anyone has advice, I would really appreciate...
>
> I am running (just installed) slurm-11.17.6, with a master + 2 hosts. It
> works locally on the master (controller + execution). However, I cannot
> establish communication from master [triumph01] with the 2 hosts
> [triumph02,triumph03]. Here is some more info:
>
> 1. munge is running, and munge verification tests all pass.
> 2. system clocks are in sync on master/hosts.
> 3. identical slurm.conf files are on master/hosts.
> 4. configuration of resources (memory/cpus/etc) are correct and have been
> confirmed on all machines (all hardware is identical).
> 5. I have attached:
> a) slurm.conf
> b) log file from master slurmctld
> c) log file from host slurmd
>
> Any ideas about what to try next?
>
> Heath Turner
>
> Professor
> Graduate Coordinator
> Chemical and Biological Engineering
> http://che.eng.ua.edu
>
> University of Alabama
> 3448 SEC, Box 870203
> Tuscaloosa, AL 35487
> (205) 348-1733 (phone)
> (205) 561-7450 (cell)
> (205) 348-7558 (fax)
> hturner at eng.ua.edu
> http://turnerresearchgroup.ua.edu
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20180521/c6dcb7b9/attachment-0001.html>
More information about the slurm-users
mailing list