[slurm-users] Intermittent "Not responding" status
    Chris Samuel 
    chris at csamuel.org
       
    Mon Dec  4 15:47:56 MST 2017
    
    
  
On Tuesday, 5 December 2017 5:57:59 AM AEDT Stradling, Alden Reid (ars9ac) 
wrote:
> I have a number of nodes that have, after our transition to Centos 7.3/SLURM
> 17.02, begun to occasionally display a status of "Not responding".
I'd suggest checking in your slurmd and slurmctld logs to see if anything 
useful is there.   Also check the system logs for messages about host 
unreachables or network storage issues too.
Good luck,
Chris
-- 
 Chris Samuel  :  http://www.csamuel.org/  :  Melbourne, VIC
    
    
More information about the slurm-users
mailing list