[slurm-users] Job not cancelled after "TimeLimit" supered

Ole Holm Nielsen Ole.H.Nielsen at fysik.dtu.dk
Tue Mar 10 10:25:08 UTC 2020


On 3/10/20 9:03 AM, sysadmin.caos wrote:
> my SLURM cluster has configured a partition with a "TimeLimit" of 8 hours. 
> Now, a job is running during 9h30m and it has been not cancelled. During 
> these 9 hours and a half, a script has executed a "scontrol update 
> partition=mypartition state=down" for disabling this partition 
> (educational cluster and at 8:00 start students classes).
> 
> Why my job hasn't been cancelled? There is no any log at SLURM controller 
> that explains this behaviour.

You may want to check the following parameter in your slurm.conf file 
(read the man-page first):

AccountingStorageEnforce: This controls what level of association-based 
enforcement to impose on job submissions.

You may want to read about EnforcePartLimits and OverTimeLimit parameters 
as well.

Display your current configuration by: scontrol show config

/Ole





More information about the slurm-users mailing list