[slurm-users] Multifactor priority configuration
Killian Murphy
killian.murphy at york.ac.uk
Thu Jan 23 07:39:27 UTC 2020
Thanks for the replies - they will help me think about configuration
changes.
On Wed, 22 Jan 2020 at 14:05, Loris Bennett <loris.bennett at fu-berlin.de>
wrote:
> Hi,
>
> We have
>
> PriorityDecayHalfLife=7-0
> PriorityMaxAge=7-0
>
> which are the defaults.
>
> I don't quite understand the point below about the business of the
> cluster. If the cluster isn't busy, the jobs won't need to compete and
> jobs belonging to users with zero shares will still start.
>
> For me the half-life should be related to the maximum time limit. If I
> allow jobs to run for, say, 14 days, I probably want that CPU-usage to
> count against the priority for a similar period, rather than decaying
> very rapidly.
>
> Ultimately the longer the half-life, the "fairer" the priorities will
> be. However, if I have only a few serious multicore power-users, I
> might want them to have a bit of an edge over hundreds of individual
> users with small numbers of single-core jobs. In that case I would
> shorten the half-life.
>
> What a good value for PriorityMaxAge is is, to my mind, even harder to
> say. The longer it is, the more you reward the time spent pending. In
> my setup it mainly helps jobs of owners who have used up all their
> shares. As well as getting back shares through the decay of CPU-usage,
> which benefits all jobs, ageing benefits individual jobs. Of course,
> how much the jobs benefit depends greatly on the weight you give to
> MaxAge.
>
> Just my 2¢
>
> Loris
>
> Hadrian Djohari <hxd58 at case.edu> writes:
>
> > Hi Killian,
> >
> > We choose to penalize the users a little only for their previous busy
> jobs, so we choose short turnarounds.
> > PriorityDecayHalfLife=1-0
> > PriorityMaxAge=4-0
> >
> > The busier the cluster, the longer should the parameters be, so the user
> previous jobs will restrict the "future" ones more.
> > These should be adjusted based on the actual usage and impact to the
> users.
> >
> > Best,
> > Hadrian
> >
> > On Wed, Jan 22, 2020 at 4:22 AM Killian Murphy <
> killian.murphy at york.ac.uk> wrote:
> >
> > Hi all.
> >
> > I’m interested to learn what people are using for the following
> configuration items:
> >
> > * PriorityDecayHalfLife
> > * PriorityMaxAge
> >
> > and why they have chosen to set these as they have. I believe we
> haven’t got these set quite right on our cluster (3-0 for both items), and
> some understanding of what other people are doing with these
> > settings might help us to get this right!
> >
> > For context, ours is a tier 3 cluster servicing mixed workloads.
> >
> > Thanks.
> >
> > Killian
> >
> > --
> > Killian Murphy
> > Research and High Performance Computing Team Leader
> > Research Software Engineer
> >
> > Information Services & Wolfson Atmospheric Chemistry Laboratories
> > University of York
> > Heslington
> > York
> > YO10 5DD
> > +44 (0)1904 32 4753
> >
> > e-mail disclaimer: http://www.york.ac.uk/docs/disclaimer/email.htm
> --
> Dr. Loris Bennett (Mr.)
> ZEDAT, Freie Universität Berlin Email loris.bennett at fu-berlin.de
>
>
--
Killian Murphy
Research and High Performance Computing Team Leader
Research Software Engineer
Information Services & Wolfson Atmospheric Chemistry Laboratories
University of York
Heslington
York
YO10 5DD
+44 (0)1904 32 4753
e-mail disclaimer: http://www.york.ac.uk/docs/disclaimer/email.htm
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20200123/e9fe3e01/attachment-0001.htm>
More information about the slurm-users
mailing list