[slurm-users] SlurmcltdHost confusion

Jackson, Gary L. Gary.Jackson at jhuapl.edu
Wed Dec 13 20:16:39 UTC 2023


The SlurmctldHost value is set like the following in my slurm.conf:

 

SlurmctldHost=host0,host1

 

That seems to be legal according to the documentation. However, I get error messages like the following:

 

$ srun id

srun: error: get_addr_info: getaddrinfo() failed: Name or service not known

srun: error: slurm_set_addr: Unable to resolve "host0,host1"

srun: error: Unable to establish control machine address

srun: error: Unable to allocate resources: Address already in use

 

If I try to put IP addresses in parentheses per the documentation, I get different errors:

 

$ srun id

srun: error: Bad value "host0(12.34.56.78),host1" for SlurmctldHost

srun: error: No SlurmctldHost defined.

srun: fatal: Unable to process configuration file

 

If I put a single hostname, or a hostname with an address in parentheses as the value for SlurmctldHost, it works fine but I have no failover.

 

I’m running 23.02.6:

 

$ sinfo --version

slurm 23.02.6

 

What’s going on?

 

-- 

Gary

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20231213/c8e62d55/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 6200 bytes
Desc: not available
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20231213/c8e62d55/attachment.bin>


More information about the slurm-users mailing list