[slurm-users] Aborting a job from inside the prolog
alexander.grund at tu-dresden.de
Tue Jun 20 08:55:57 UTC 2023
Am 19.06.23 um 17:32 schrieb Gerhard Strangar:
> Try to exit with 0, because it's not your prolog that failed.
That seemingly works.
I do see a value in exiting with 1 to drain the node to investigate
why/what has exactly failed.
Although it may be better to not drain it, I'm a bit nervous with "exit
0" as it is very important that the job does not start/continue, i.e.
the user code (sbatch script/srun) is never executed in that case.
So I want to be sure that an `scancel` on the job in its prolog is
actually always preventing the job from running.
-------------- next part --------------
A non-text attachment was scrubbed...
Size: 5782 bytes
Desc: S/MIME Cryptographic Signature
More information about the slurm-users