[slurm-users] Requirement to run longer jobs
Thomas M. Payerle
payerle at umd.edu
Wed Jul 3 19:05:20 UTC 2019
The dual QoSes (or dual partition solution suggested by someone else)
should both work in allow select users to submit jobs with longer run
times. We use something like that on our cluster (though I confess it was
our first Slurm cluster and we might have overdid it with QoSes causing
scheduler to work harder). But for simple case you have, only downside I
see is potential extra work in creating user associations, etc., which is
not a problem if scripted.
I am not sure if either would work in extending run time of running jobs,
though I expect it might be possible with QoS approach. (I think it is far
more likely to be able to change QoS of a running job than the partition,
even if both partitions consist of the same set of nodes). Also not sure
if user can do that or if it would require sysadmin involvement.
On Wed, Jul 3, 2019 at 11:52 AM David Baker <D.J.Baker at soton.ac.uk> wrote:
> A few of our users have asked about running longer jobs on our cluster.
> Currently our main/default compute partition has a time limit of 2.5 days.
> Potentially, a handful of users need jobs to run up to 5 hours. Rather than
> allow all users/jobs to have a run time limit of 5 days I wondered if the
> following scheme makes sense...
> Increase the max run time on the default partition to be 5 days, however
> limit most users to a max of 2.5 days using the default "normal" QOS.
> Create a QOS called "long" with a max time limit of 5 days. Limit the user
> who can use "long". For authorized users assign "long" QOS to their jobs on
> basis of run time request.
> Does the above make sense or is it too complicated? If the above works
> could users limited to using the normal QOS have their running jobs run
> time increased to 5 days in exceptional circumstances?
> I would be interested in your thoughts, please.
> Best regards,
DIT-ACIGS/Mid-Atlantic Crossroads payerle at umd.edu
5825 University Research Park (301) 405-6135
University of Maryland
College Park, MD 20740-3831
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the slurm-users