[slurm-users] How to check if there's a reservation

Prentice Bisbal pbisbal at pppl.gov
Wed Jun 13 11:37:03 MDT 2018


Just to revisit this, for jobs that are queued, but prevented from 
running, will have a more useful reason in 18.08, which will address one 
of my issues with reservation collisions.

https://bugs.schedmd.com/show_bug.cgi?id=5138

https://bugs.schedmd.com/show_bug.cgi?id=4987

Prentice Bisbal
Lead Software Engineer
Princeton Plasma Physics Laboratory
http://www.pppl.gov

On 05/11/2018 10:36 AM, Douglas Jacobsen wrote:
> A feature that many slurm users might like is sbatch --time-min.  
> Using both --time-min and --time a user can specify the range of 
> acceptable wall times limits.  This can make it much easier to keep 
> jobs running right  up to the maintenance reservation.  e.g.:
>
> sbatch --time-min=30:00 --time=48:00:00 script.sh
>
> would allow the job to schedule for any time-slot between 30 minutes 
> and 2 days in length.  If the user has some mechanism for job chaining 
> or similar, this can allow them to make the most of backfill 
> opportunities.
>
> -Doug
>
> ----
> Doug Jacobsen, Ph.D.
> NERSC Computer Systems Engineer
> National Energy Research Scientific Computing Center 
> <http://www.nersc.gov>
> dmjacobsen at lbl.gov <mailto:dmjacobsen at lbl.gov>
>
> ------------- __o
> ---------- _ '\<,_
> ----------(_)/  (_)__________________________
>
>
>
> On Fri, May 11, 2018 at 7:27 AM Paul Edmon <pedmon at cfa.harvard.edu 
> <mailto:pedmon at cfa.harvard.edu>> wrote:
>
>     In the past we used the LUA job submit plugin to block jobs that
>     would
>     intersect maintenance reservations.  I would look at that.
>
>     -Paul Edmon-
>
>
>     On 05/11/2018 08:19 AM, Bill Wichser wrote:
>     > The problem is that reservations can be in there yet have no
>     effect on
>     > the submitted job if they would run before the reservation takes
>     > place. One can pull the starting time simply using something
>     like this
>     >
>     > scontrol show res -o | awk '{print $2}'
>     >
>     > with output
>     >
>     > StartTime=2018-06-12T06:00:00
>     > StartTime=2018-06-12T06:00:00
>     >
>     > You'd need more code around that, obviously, to determine if this
>     > starttime might hold up the job.
>     >
>     > Bill
>     >
>     >
>     > On 05/10/2018 04:23 PM, Prentice Bisbal wrote:
>     >> Dear Slurm Users,
>     >>
>     >> We've started using maintenance reservations. As you would expect,
>     >> this caused some confusion for users who were wondering why their
>     >> jobs were queuing up and not running. Some of my users provide a
>     >> public service of sorts that automatically submits jobs to our
>     >> cluster. They would like to have their submission framework
>     >> automatically detect if there's a reservation that may
>     interfere with
>     >> their jobs, and act accordingly.
>     >>
>     >> What is the best way to do this? Typically, in my shell scripts, I
>     >> have some command that tests something, and then check exit code
>     >> returned by the command. For example to check if my name is in
>     file
>     >> 'foo.txt', I'd do something like this:
>     >>
>     >> grep -iq prentice foo.txt
>     >> retval=$?
>     >> if [ $retval -eq 0 ]; then
>     >>      echo "Prentice found"
>     >> else
>     >>      echo "Prentice not found"
>     >> fi
>     >> unset retval
>     >>
>     >> Or something like that. I was also thinking this might work, too:
>     >>
>     >> num_res=$(scontrol -o show res  | wc -l)
>     >> if [ $num_res -eq 0 ]; then
>     >>      echo "No reservations found"
>     >> else
>     >>      echo "$num_res reservation(s) found"
>     >> fi
>     >>
>     >> Are there any better or other ways that you would recommend?
>     Also, if
>     >> there's more than one, is are they listed in any kind of order
>     in the
>     >> scontrol or sinfo output (soonest first, soonest last, etc.)? From
>     >> the man page, it looks like 'scontrol show reservation' doesn't
>     >> provide any sorting.
>     >>
>     >> Prentice
>     >>
>     >>
>     >>
>     >>
>     >>
>     >
>
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20180613/d667fb71/attachment-0001.html>


More information about the slurm-users mailing list