rug262 at psu.edu
Fri Mar 10 16:43:18 UTC 2023
>From what I'm reading in the man pages, it seems like PreempExemptTime isn't compatible with suspending jobs instead of requeue/cancelling them, no matter if at the partition or qos level. Am I reading that correctly? We currently give users the option of submitting to a partition that lets their job be suspended, or to one where they're requeued. Do we have to give that up to offer a min run time on jobs (for just one partition/qos)?
From: slurm-users <slurm-users-bounces at lists.schedmd.com> on behalf of Christopher Samuel <chris at csamuel.org>
Sent: Tuesday, March 7, 2023 3:40 PM
To: slurm-users at lists.schedmd.com <slurm-users at lists.schedmd.com>
Subject: Re: [slurm-users] PreemptExemptTime
On 3/7/23 6:46 am, Groner, Rob wrote:
> Over global settings are PreemptMode=SUSPEND,GANG and
> PreemptType=preempt/partition_prio. We have a high priority partition
> that nothing should ever preempt, and an open partition that is always
> preemptable. In between is a burst partition. It can be preempted if
> the high priority partition needs the resources. That's the partition
> we'd like to guarantee a 1 hour run time on. Looking at the sacctmgr
> man page, it gives this info on QOS
Just a quick comment, here you're talking about both partitions and
QOS's with respect to preemption, I think for this you need to pick just
one of those options and only use those configs. For instance we just
use QOS's for preemption and our exempt time works in that case.
Hope this helps!
All the best,
Chris Samuel : https://nam10.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.csamuel.org%2F&data=05%7C01%7Crug262%40psu.edu%7C0ed8f546e70843e7266208db1f4c5471%7C7cf48d453ddb4389a9c1c115526eb52e%7C0%7C0%7C638138184919726959%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=md1MaA%2FIj%2Fb%2B27e7JLkVwSY5IUSVnsJIdx6PADl0JF4%3D&reserved=0 : Berkeley, CA, USA
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the slurm-users