Issue running slurm commands as normal account but work as root.
Hello all! I have an odd question. On my headnode, also my login nodes, I can ping my outside network and inside network using any DNS name I choose. It is dual homed. When I run squeue, sinfo, or any slurm command as my regular account I get the below error. I can run srun, squeue, sinfo, or any slurm command and it works as expected. Does anyone know what is going on? I feel like this is a permissions issue on something. On the compute nodes I have no problem running slurm commands as my regular account or root. Everything works well on my compute nodes. Error: [user.name@headnode ~]$ squeue squeue: error: resolve_ctls_from_dns_srv: res_nsearch error: Hostname lookup failure squeue: error: fetch_config: DNS SRV lookup failed squeue: error: _establish_config_source: failed to fetch config squeue: fatal: Could not establish a configuration source Thanks! Kent Hanson
Is your regular user unable to read the slurm.conf? How is the cluster set up to get the hostname of the Slurm controller?
I did find the error. It was a missing x permission on a folder. ________________________________ From: Laura Hild <lsh@jlab.org> Sent: Wednesday, February 5, 2025 2:23:58 PM To: Kent L. Hanson <Kent.Hanson@inl.gov> Cc: 'slurm-users@schedmd.com' <slurm-users@schedmd.com> Subject: Re: Issue running slurm commands as normal account but work as root. Is your regular user unable to read the slurm.conf? How is the cluster set up to get the hostname of the Slurm controller?
participants (2)
-
Kent L. Hanson -
Laura Hild