[slurm-users] Nodes stuck in drain state
    Roger Mason 
    rmason at mun.ca
       
    Thu May 25 14:30:37 UTC 2023
    
    
  
Ole Holm Nielsen <Ole.H.Nielsen at fysik.dtu.dk> writes:
> 1. Is slurmd running on the node?
Yes.
> 2. What's the output of "slurmd -C" on the node?
NodeName=node012 CPUs=4 Boards=1 SocketsPerBoard=2 CoresPerSocket=2
ThreadsPerCore=1 RealMemory=6097
> 3. Define State=UP in slurm.conf in stead of UNKNOWN
Will do.
> 4. Why have you configured TmpDisk=0?  It should be the size of the
> /tmp filesystem.
I have not configured TmpDisk.  This the entry in slurm.conf for that
node:
NodeName=node012 CPUs=4 Boards=1 SocketsPerBoard=2 CoresPerSocket=2
ThreadsPerCore=1 RealMemory=10193  State=UNKNOWN
But I do notice that slurmd -C now says there is less memory than
configured.
Thanks again.
Roger
    
    
More information about the slurm-users
mailing list