Hello all!
I have an odd question.
On my headnode, also my login nodes, I can ping my outside network and inside network using any DNS name I choose. It is dual homed. When I run squeue, sinfo, or any slurm command as my regular account I get the below error. I can run srun, squeue, sinfo, or any slurm command and it works as expected. Does anyone know what is going on? I feel like this is a permissions issue on something.
On the compute nodes I have no problem running slurm commands as my regular account or root. Everything works well on my compute nodes.
Error:
[user.name@headnode ~]$ squeue
squeue: error: resolve_ctls_from_dns_srv: res_nsearch error: Hostname lookup failure
squeue: error: fetch_config: DNS SRV lookup failed
squeue: error: _establish_config_source: failed to fetch config
squeue: fatal: Could not establish a configuration source
Thanks!
Kent Hanson
Is your regular user unable to read the slurm.conf? How is the cluster set up to get the hostname of the Slurm controller?
I did find the error. It was a missing x permission on a folder. ________________________________ From: Laura Hild lsh@jlab.org Sent: Wednesday, February 5, 2025 2:23:58 PM To: Kent L. Hanson Kent.Hanson@inl.gov Cc: 'slurm-users@schedmd.com' slurm-users@schedmd.com Subject: Re: Issue running slurm commands as normal account but work as root.
Is your regular user unable to read the slurm.conf? How is the cluster set up to get the hostname of the Slurm controller?