[slurm-users] Job not cancelled after "TimeLimit" supered
Ole Holm Nielsen
Ole.H.Nielsen at fysik.dtu.dk
Tue Mar 10 10:25:08 UTC 2020
On 3/10/20 9:03 AM, sysadmin.caos wrote:
> my SLURM cluster has configured a partition with a "TimeLimit" of 8 hours.
> Now, a job is running during 9h30m and it has been not cancelled. During
> these 9 hours and a half, a script has executed a "scontrol update
> partition=mypartition state=down" for disabling this partition
> (educational cluster and at 8:00 start students classes).
>
> Why my job hasn't been cancelled? There is no any log at SLURM controller
> that explains this behaviour.
You may want to check the following parameter in your slurm.conf file
(read the man-page first):
AccountingStorageEnforce: This controls what level of association-based
enforcement to impose on job submissions.
You may want to read about EnforcePartLimits and OverTimeLimit parameters
as well.
Display your current configuration by: scontrol show config
/Ole
More information about the slurm-users
mailing list