[slurm-users] [EXTERNAL] Re: trying to diagnose a connectivity issue between the slurmctld process and the slurmd nodes
mercan
ahmet.mercan at uhem.itu.edu.tr
Mon Nov 30 16:33:49 UTC 2020
Hi;
Did you test munge connection? If not, would you test it like this
munge -n | ssh SRVGRIDSLURM02 unmunge
Ahmet M.
30.11.2020 14:43 tarihinde Steve Bland yazdı:
> Thanks Diego
>
> actually, nothing at all in the hosts file, did not seem to need to
> modify it to see the nodes.
> the different case on one of the nodes was an experiment to see if the
> names were in fact case-sensitive
>
> but all networking functions between the nodes, with say munge, all
> seem to work
>
> just not slurmctld taking to the nodes, even though chatter can be
> seen between them in the log with a higher log level set
>
>
>
>
> *Steve Bland*
> /Technical Product Manager///
>
> /Third Party Products/
> Ross Video | Production Technology Experts
> T: +1 (613) 228-0688 ext.4219
> www.rossvideo.com <http://www.rossvideo.com/>
>
> ------------------------------------------------------------------------
> *From:* Diego Zuccato <diego.zuccato at unibo.it>
> *Sent:* 30 November 2020 02:20
> *To:* Slurm User Community List <slurm-users at lists.schedmd.com>; Steve
> Bland <sbland at rossvideo.com>
> *Subject:* Re: [slurm-users] [EXTERNAL] Re: trying to diagnose a
> connectivity issue between the slurmctld process and the slurmd nodes
> Il 27/11/20 17:18, Steve Bland ha scritto:
>
> > NodeName=SRVGRIDSLURM01 NodeAddr=192.168.1.60 CPUs=4 Boards=1
> > SocketsPerBoard=1 CoresPerSocket=4 ThreadsPerCore=1 RealMemory=7821
> > NodeName=SRVGRIDSLURM02 NodeAddr=192.168.1.61 CPUs=4 Boards=1
> > SocketsPerBoard=1 CoresPerSocket=4 ThreadsPerCore=1 RealMemory=7821
> > NodeName=srvgridslurm03 NodeAddr=192.168.1.62 CPUs=4 Boards=1
> > SocketsPerBoard=1 CoresPerSocket=4 ThreadsPerCore=1 RealMemory=7821
> The only issue I see here is that Slurm is case-sensitive. Maybe you
> have case-different names for the nodes in your /etc/hosts ?
> Just guessing, tho.
>
> --
> Diego Zuccato
> DIFA - Dip. di Fisica e Astronomia
> Servizi Informatici
> Alma Mater Studiorum - Università di Bologna
> V.le Berti-Pichat 6/2 - 40127 Bologna - Italy
> tel.: +39 051 20 95786
> ----------------------------------------------
>
> This e-mail and any attachments may contain information that is
> confidential to Ross Video.
>
> If you are not the intended recipient, please notify me immediately by
> replying to this message. Please also delete all copies. Thank you.
More information about the slurm-users
mailing list