[slurm-users] Intermittent "Not responding" status

Chris Samuel chris at csamuel.org
Mon Dec 4 15:47:56 MST 2017


On Tuesday, 5 December 2017 5:57:59 AM AEDT Stradling, Alden Reid (ars9ac) 
wrote:

> I have a number of nodes that have, after our transition to Centos 7.3/SLURM
> 17.02, begun to occasionally display a status of "Not responding".

I'd suggest checking in your slurmd and slurmctld logs to see if anything 
useful is there.   Also check the system logs for messages about host 
unreachables or network storage issues too.

Good luck,
Chris
-- 
 Chris Samuel  :  http://www.csamuel.org/  :  Melbourne, VIC




More information about the slurm-users mailing list