[slurm-users] Aborting a job from inside the prolog

Alexander Grund alexander.grund at tu-dresden.de
Tue Jun 20 08:55:57 UTC 2023


Am 19.06.23 um 17:32 schrieb Gerhard Strangar:
> Try to exit with 0, because it's not your prolog that failed.

That seemingly works.
I do see a value in exiting with 1 to drain the node to investigate 
why/what has exactly failed.

Although it may be better to not drain it, I'm a bit nervous with "exit 
0" as it is very important that the job does not start/continue, i.e. 
the user code (sbatch script/srun) is never executed in that case.
So I want to be sure that an `scancel` on the job in its prolog is 
actually always preventing the job from running.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 5782 bytes
Desc: S/MIME Cryptographic Signature
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20230620/fea46b2c/attachment.bin>


More information about the slurm-users mailing list