[slurm-users] Unable to contact slurm controller

Alex Chekholko alex at calicolabs.com
Tue Jul 31 11:02:39 MDT 2018


Seems like your slurmctld is not running.  Have you checked its log to see
why?

On Tue, Jul 31, 2018 at 8:35 AM Mahmood Naderan <mahmood.nt at gmail.com>
wrote:

> Hi,
> It seems that squeue is broken due to the following error:
>
> [root at rocks7 ~]# squeue
> slurm_load_jobs error: Unable to contact slurm controller (connect failure)
> [root at rocks7 ~]#  systemctl restart slurmd
> [root at rocks7 ~]#  systemctl restart slurmctld
> [root at rocks7 ~]# squeue
> slurm_load_jobs error: Unable to contact slurm controller (connect failure)
> [root at rocks7 ~]# ps aux | grep slurm
> root      2969  0.0  0.0 343112  3268 ?        Sl   Jul07   0:12
> /usr/sbin/slurmdbd
> kouhika+ 22930  0.0  0.0   4348   348 pts/2    S+   Jul30   0:00
> /usr/libexec/slurm-spank-x11 -t compute-0-6 -i 803.0 -cgw -s ssh -o
> kouhika+ 22931  9.7  0.0 192296 20292 pts/2    S+   Jul30 145:28 ssh -Y
> compute-0-6 /usr/libexec/slurm-spank-x11 -i 803.0 -c -g -w -s "ssh" -o ""
> root     28532  0.0  0.0 143132  2072 ?        Sl   20:02   0:00
> /usr/sbin/slurmd
> root     29364  0.0  0.0 112712   964 pts/12   S+   20:03   0:00 grep
> --color=auto slurm
>
>
> As you can see I tried to restart slurm processes, however, has no effect.
> Any thought?
>
>
> Regards,
> Mahmood
>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20180731/7eeb7a88/attachment-0001.html>


More information about the slurm-users mailing list