[slurm-users] srun problem -- Can't find an address, check slurm.conf
Paul Edmon
pedmon at cfa.harvard.edu
Wed Nov 7 08:24:01 MST 2018
Yeah, these are frustrating ones to troubleshoot. When I have seen this
in the past it was usually a missing forward or reverse in DNS that
cause the problem. You could try dialing up the verbosity all the way
and see what you can spot. Else I might recommend dropping a ticket
into the SchedMD guys to see if they have any more insight. Then again
some one on this list might have seen the same issue.
-Paul Edmon-
On 11/7/18 10:20 AM, Scott Hazelhurst wrote:
> Thanks, Paul, yes, it does seem a likely cause, but I can’t see the problem. All machines have the same /etc/hosts file and the worker nodes are just listed one after each other. I’ve checked that the problem nodes are there — no obvious difference. I’ve checked that the IP address is correct.
>
> Moreover, I can ping and ssh either using the node name (e.g. n38) or the fqdn
>
> Scott
>
>
>
>
>> On 07 Nov 2018, at 16:57, Paul Edmon <pedmon at cfa.harvard.edu> wrote:
>>
>> This smacks of either the submission host, the destination host, or the master not being able to resolve the name to an IP. I would triple check that to ensure that resolution is working.
>>
>> -Paul Edmon-
> This communication is intended for the addressee only. It is confidential. If you have received this communication in error, please notify us immediately and destroy the original message. You may not copy or disseminate this communication without the permission of the University. Only authorised signatories are competent to enter into agreements on behalf of the University and recipients are thus advised that the content of this message may not be legally binding on the University and may contain the personal views and opinions of the author, which are not necessarily the views and opinions of The University of the Witwatersrand, Johannesburg. All agreements between the University and outsiders are subject to South African Law unless the University agrees in writing to the contrary.
More information about the slurm-users
mailing list