[slurm-users] backfill on overlapping partitions problem
Andrej Filipcic
andrej.filipcic at ijs.si
Tue Oct 26 14:41:32 UTC 2021
Hi,
We have a strange problem with backfilling, there are
large partition "cpu" and overlapping partition "largemem" which is a
subset of "cpu" nodes.
Now, user A is submitting low priority jobs to "cpu", user B high
priority jobs to "largemem"
If there are queued jobs in "largemem" (draining nodes there), the
slurmctld would never backfill the "cpu". At the extreme,
non-overlapping "cpu" nodes would get empty until higher prio jobs get
all running in "largemem"
Any hint or workaround here? backfill works quite fine if all the jobs
are submitted to "cpu" partition. User A has typically smaller and
shorter jobs, good for backfilling.
we use these settings with slurm:
PriorityType=priority/multifactor
SchedulerType=sched/backfill
SelectType=select/cons_tres
SelectTypeParameters=CR_CORE_MEMORY,CR_CORE_DEFAULT_DIST_BLOCK
SchedulerParameters =
bf_max_job_test=2000,bf_window=1440,default_queue_depth=1000,bf_continue
Best regards,
Andrej
--
_____________________________________________________________
prof. dr. Andrej Filipcic, E-mail: Andrej.Filipcic at ijs.si
Department of Experimental High Energy Physics - F9
Jozef Stefan Institute, Jamova 39, P.o.Box 3000
SI-1001 Ljubljana, Slovenia
Tel.: +386-1-477-3674 Fax: +386-1-425-7074
-------------------------------------------------------------
More information about the slurm-users
mailing list