[slurm-users] Partition Hold/Release
Nicolas Sonoda
nicolas.sonoda at versatushpc.com.br
Wed Mar 15 18:12:23 UTC 2023
Hi Marcus and Kevin,
I'm sorry, I forgot to set DefMemPerCPU on my partitions so my preempt was not working.
But after I set, the preempt worked fine and my jobs with low priority can suspend.
Thank you very much!
Regards,
Nícolas
________________________________
De: slurm-users <slurm-users-bounces at lists.schedmd.com> em nome de Kevin Broch <kbroch at rivosinc.com>
Enviado: quarta-feira, 15 de março de 2023 12:51
Para: Slurm User Community List <slurm-users at lists.schedmd.com>
Assunto: Re: [slurm-users] Partition Hold/Release
Nicolas,
It looks like for the partition named "test" you still have PreemptMode=off ?
On Wed, Mar 15, 2023 at 7:35 AM Wagner, Marcus <wagner at itc.rwth-aachen.de<mailto:wagner at itc.rwth-aachen.de>> wrote:
Hi Nicolas,
sorry to say, but we have no experience with preemption.
Best
Marcus
Am 14.03.2023 um 22:07 schrieb Nicolas Sonoda:
Hi Marcus,
Thank you very much for the response.
I set the PriorityTier for my partitions and also set PreemptType=preempt/partition_prio and PreemptMode=SUSPEND,GANG. But the job in the low priority partition does not change it state to SUSPEND. Have any idea?
Following are some information:
slurm.conf:
PreemptType=preempt/partition_prio
PreemptMode=SUSPEND,GANG
NodeName=n[24] Sockets=2 CoresPerSocket=24 ThreadsPerCore=1 RealMemory=185000 State=UNKNOWN
PartitionName=test Nodes=n[24] Default=NO MaxTime=INFINITE State=UP PriorityTier=300 PriorityJobFactor=100 OverSubscribe=FORCE PreemptMode=off AllowGroups=teste
PartitionName=test2 Nodes=n[24] Default=NO MaxTime=INFINITE State=UP PriorityTier=20 PriorityJobFactor=1 OverSubscribe=FORCE PreemptMode=suspend,gang AllowGroups=teste
$ squeue -u vhpc
JOBID PARTITION NAME USER ST TIME NODES NODELIST(REASON)
46003 test test vhpc PD 0:00 1 (Resources)
46002 test2 test2 vhpc R 0:05 1 n24
Thank you,
Nícolas
________________________________
De: slurm-users em nome de Wagner, Marcus
Enviadas: Terça-feira, 14 de Março de 2023 07:25
Para: slurm-users at lists.schedmd.com<mailto:slurm-users at lists.schedmd.com>
Assunto: Re: [slurm-users] Partition Hold/Release
Hi Nicolas,
you could use the prioritytier for partitions:
PriorityTier
Jobs submitted to a partition with a higher PriorityTier value will be evaluated by the scheduler before pending jobs in a partition with a lower PriorityTier value. They will
also be considered for preemption of running jobs in partition(s) with lower PriorityTier values if PreemptType=preempt/partition_prio. The value may not exceed 65533. Also
see PriorityJobFactor.
Best
Marcus
Am 06.03.2023 um 19:33 schrieb Nicolas Sonoda:
Hi!
Can I create a partition with a capacity of hold and release jobs when another partition jobs is submited? For example, the partition one and two can hold their jobs when some job of partition three is submited, and after this job completes the partition one and two releases their jobs again.
Thank you.
Nícolas
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20230315/0aa7edfd/attachment-0001.htm>
More information about the slurm-users
mailing list