[slurm-users] How to debug a prolog script?
Bjørn-Helge Mevik
b.h.mevik at usit.uio.no
Mon Sep 19 05:50:05 UTC 2022
Davide DelVento <davide.quantum at gmail.com> writes:
>> I'm curious: What kind of disruption did it cause for your production
>> jobs?
>
> All jobs failed and went in pending/held with "launch failed requeued
> held" status, all nodes where the jobs were scheduled went draining.
>
> The logs only said "error: validate_node_specs: Prolog or job env
> setup failure on node xxxx, draining the node". I guess if they said
> "-bash: /path/to/prolog: Permission denied" I would have caught the
> problem myself.
But that is not a problem caused by having things like
exec &> /root/prolog_slurmd.$$
in the script, as you indicated. It is a problem caused by the prolog
script file not being executable.
> In hindsight it is obvious, but I don't think even the documentation
> mentions that, does it? After all you can execute a file with a
> non-executable with with "sh filename", so I made the incorrect
> assumption that slurm would have invoked the prolog that way.
Slurm prologs can be written in any language - we used to have perl
prolog scripts. :)
--
Regards,
Bjørn-Helge Mevik, dr. scient,
Department for Research Computing, University of Oslo
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 832 bytes
Desc: not available
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20220919/1966bd0e/attachment.sig>
More information about the slurm-users
mailing list