[slurm-users] epilog when job is killed for max time
Noam Bernstein
noam.bernstein at nrl.navy.mil
Thu Nov 8 06:14:33 MST 2018
Thanks - that's an awesome, yet horrible, hack :)
Noam
> On Nov 8, 2018, at 3:26 AM, Josep Manel Andrés Moscardó <josep.moscardo at embl.de> wrote:
>
> Hi,
> Somebody else gave me this piece of code (I hope he doesn't mind me sharing it :) , at least it is how they do it:
>
> #!/bin/bash
>
> #SBATCH --signal=B:USR1 at 300 #<------ This will make Slurm send signal USR1 to the bash process 300 seconds before the time limit
> #SBATCH -t 00:06:00
>
> resubmit() {
> echo "It's time to resubmit"; # <----- Run whatever is necessary. Shutdown the DDBB, resubmit the job using the checkpointing files...
> }
>
> trap "resubmit" USR1
>
> sleep 1000 & # <----- Your program goes here. It's important to run on the background otherwise bash will not process the signal until this command finishes
>
> wait # <---- wait until all the background processes are finished. If a signal is received this will stop, process the signal and finish the script.
>
>
> On 7/11/18 21:16, Noam Bernstein wrote:
>> Hi slurm users - I’ve been looking through the slurm prolog/epilog manuals, but haven’t been able to figure out if there’s a way to get an epilog script (requested by the user) to run when a job is killed for running out of time, and have the epilog script be able to detect that (through an env variable, for example). Is this possible?
>> thanks,
>> Noam
>
> --
> Josep Manel Andrés Moscardó
> Systems Engineer, IT Operations
> EMBL Heidelberg
> T +49 6221 387-8394
>
____________
||
|U.S. NAVAL|
|_RESEARCH_|
LABORATORY
Noam Bernstein, Ph.D.
Center for Materials Physics and Technology
U.S. Naval Research Laboratory
T +1 202 404 8628 F +1 202 404 7546
https://www.nrl.navy.mil <https://www.nrl.navy.mil/>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20181108/054edaf5/attachment.html>
More information about the slurm-users
mailing list