[slurm-users] network/communication failure

Fulcomer, Samuel samuel_fulcomer at brown.edu
Mon May 21 09:14:32 MDT 2018


Is there a firewall turned on? What does "iptables -L -v" report on the
three hosts?

On Mon, May 21, 2018 at 11:05 AM, Turner, Heath <Hturner at eng.ua.edu> wrote:

> If anyone has advice, I would really appreciate...
>
> I am running (just installed) slurm-11.17.6, with a master + 2 hosts.  It
> works locally on the master (controller + execution).  However, I cannot
> establish communication from master [triumph01] with the 2 hosts
> [triumph02,triumph03].  Here is some more info:
>
> 1. munge is running, and munge verification tests all pass.
> 2. system clocks are in sync on master/hosts.
> 3. identical slurm.conf files are on master/hosts.
> 4. configuration of resources (memory/cpus/etc) are correct and have been
> confirmed on all machines (all hardware is identical).
> 5. I have attached:
>         a) slurm.conf
>         b) log file from master slurmctld
>         c) log file from host slurmd
>
> Any ideas about what to try next?
>
> Heath Turner
>
> Professor
> Graduate Coordinator
> Chemical and Biological Engineering
> http://che.eng.ua.edu
>
> University of Alabama
> 3448 SEC, Box 870203
> Tuscaloosa, AL  35487
> (205) 348-1733 (phone)
> (205) 561-7450 (cell)
> (205) 348-7558 (fax)
> hturner at eng.ua.edu
> http://turnerresearchgroup.ua.edu
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20180521/c6dcb7b9/attachment-0001.html>


More information about the slurm-users mailing list