[slurm-users] Intermittent "Not responding" status
Chris Samuel
chris at csamuel.org
Mon Dec 4 15:47:56 MST 2017
On Tuesday, 5 December 2017 5:57:59 AM AEDT Stradling, Alden Reid (ars9ac)
wrote:
> I have a number of nodes that have, after our transition to Centos 7.3/SLURM
> 17.02, begun to occasionally display a status of "Not responding".
I'd suggest checking in your slurmd and slurmctld logs to see if anything
useful is there. Also check the system logs for messages about host
unreachables or network storage issues too.
Good luck,
Chris
--
Chris Samuel : http://www.csamuel.org/ : Melbourne, VIC
More information about the slurm-users
mailing list