[slurm-users] Unable to contact slurm controller

Mahmood Naderan mahmood.nt at gmail.com
Tue Jul 31 09:34:43 MDT 2018


Hi,
It seems that squeue is broken due to the following error:

[root at rocks7 ~]# squeue
slurm_load_jobs error: Unable to contact slurm controller (connect failure)
[root at rocks7 ~]#  systemctl restart slurmd
[root at rocks7 ~]#  systemctl restart slurmctld
[root at rocks7 ~]# squeue
slurm_load_jobs error: Unable to contact slurm controller (connect failure)
[root at rocks7 ~]# ps aux | grep slurm
root      2969  0.0  0.0 343112  3268 ?        Sl   Jul07   0:12
/usr/sbin/slurmdbd
kouhika+ 22930  0.0  0.0   4348   348 pts/2    S+   Jul30   0:00
/usr/libexec/slurm-spank-x11 -t compute-0-6 -i 803.0 -cgw -s ssh -o
kouhika+ 22931  9.7  0.0 192296 20292 pts/2    S+   Jul30 145:28 ssh -Y
compute-0-6 /usr/libexec/slurm-spank-x11 -i 803.0 -c -g -w -s "ssh" -o ""
root     28532  0.0  0.0 143132  2072 ?        Sl   20:02   0:00
/usr/sbin/slurmd
root     29364  0.0  0.0 112712   964 pts/12   S+   20:03   0:00 grep
--color=auto slurm


As you can see I tried to restart slurm processes, however, has no effect.
Any thought?


Regards,
Mahmood
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20180731/173546bb/attachment.html>


More information about the slurm-users mailing list