[slurm-users] [EXTERNAL] Re: trying to diagnose a connectivity issue between the slurmctld process and the slurmd nodes

mercan ahmet.mercan at uhem.itu.edu.tr
Mon Nov 30 16:33:49 UTC 2020


Hi;


Did you test munge connection? If not, would you test it like this


munge -n | ssh  SRVGRIDSLURM02  unmunge


Ahmet M.



30.11.2020 14:43 tarihinde Steve Bland yazdı:
> Thanks Diego
>
> actually, nothing at all in the hosts file, did not seem to need to 
> modify it to see the nodes.
> the different case on one of the nodes was an experiment to see if the 
> names were in fact case-sensitive
>
> but all networking functions between the nodes, with say munge, all 
> seem to work
>
> just not slurmctld taking to the nodes, even though chatter can be 
> seen between them in the log with a higher log level set
>
>
>
>
> *Steve Bland*
> /Technical Product Manager///
>
> /Third Party Products/
> Ross Video | Production Technology Experts
> T: +1 (613) 228-0688 ext.4219
> www.rossvideo.com <http://www.rossvideo.com/>
>
> ------------------------------------------------------------------------
> *From:* Diego Zuccato <diego.zuccato at unibo.it>
> *Sent:* 30 November 2020 02:20
> *To:* Slurm User Community List <slurm-users at lists.schedmd.com>; Steve 
> Bland <sbland at rossvideo.com>
> *Subject:* Re: [slurm-users] [EXTERNAL] Re: trying to diagnose a 
> connectivity issue between the slurmctld process and the slurmd nodes
> Il 27/11/20 17:18, Steve Bland ha scritto:
>
> > NodeName=SRVGRIDSLURM01 NodeAddr=192.168.1.60 CPUs=4 Boards=1
> > SocketsPerBoard=1 CoresPerSocket=4 ThreadsPerCore=1 RealMemory=7821
> > NodeName=SRVGRIDSLURM02 NodeAddr=192.168.1.61 CPUs=4 Boards=1
> > SocketsPerBoard=1 CoresPerSocket=4 ThreadsPerCore=1 RealMemory=7821
> > NodeName=srvgridslurm03 NodeAddr=192.168.1.62 CPUs=4 Boards=1
> > SocketsPerBoard=1 CoresPerSocket=4 ThreadsPerCore=1 RealMemory=7821
> The only issue I see here is that Slurm is case-sensitive. Maybe you
> have case-different names for the nodes in your /etc/hosts ?
> Just guessing, tho.
>
> -- 
> Diego Zuccato
> DIFA - Dip. di Fisica e Astronomia
> Servizi Informatici
> Alma Mater Studiorum - Università di Bologna
> V.le Berti-Pichat 6/2 - 40127 Bologna - Italy
> tel.: +39 051 20 95786
> ----------------------------------------------
>
> This e-mail and any attachments may contain information that is 
> confidential to Ross Video.
>
> If you are not the intended recipient, please notify me immediately by 
> replying to this message. Please also delete all copies. Thank you. 



More information about the slurm-users mailing list