[slurm-users] Compact scheduling strategy for small GPU jobs
diego.zuccato at unibo.it
Fri Aug 6 10:30:26 UTC 2021
Maybe your jobs are requesting more RAM (or other resources) that after
6 other jobs are no longer available on first node?
Try checking with scontrol show node .
Il 06/08/2021 08:52, Jack Chen ha scritto:
> I'm using slurm15.08.11, when I submit several 1 gpu jobs, slurm doesn't
> allocate nodes using compact strategy. Anyone know how to solve this?
> Will upgrading slurm latest version help ?
> For example, there are two nodes A and B with 8 gpus per node, I
> submitted 8 1 gpu jobs, slurm will allocate first 6 jobs on node A, then
> last 2 jobs on node B. Then when I submit one job with 8 gpus, it will
> pending because of gpu fragments: nodes A has 2 idle gpus, node b 6 idle
> Thanks in advance!
DIFA - Dip. di Fisica e Astronomia
Alma Mater Studiorum - Università di Bologna
V.le Berti-Pichat 6/2 - 40127 Bologna - Italy
tel.: +39 051 20 95786
More information about the slurm-users