[slurm-users] Compact scheduling strategy for small GPU jobs

Diego Zuccato diego.zuccato at unibo.it
Fri Aug 6 10:30:26 UTC 2021


Hi.

Maybe your jobs are requesting more RAM (or other resources) that after 
6 other jobs are no longer available on first node?

Try checking with scontrol show node .

BYtE,
  Diego

Il 06/08/2021 08:52, Jack Chen ha scritto:
> I'm using slurm15.08.11, when I submit several 1 gpu jobs, slurm doesn't 
> allocate nodes using compact strategy. Anyone know how to solve this? 
> Will upgrading slurm latest version help ?
> 
> For example, there are two nodes A and B with 8 gpus per node, I 
> submitted 8 1 gpu jobs, slurm will allocate first 6 jobs on node A, then 
> last 2 jobs on node B. Then when I submit one job with 8 gpus, it will 
> pending because of gpu fragments: nodes A has 2 idle gpus, node b 6 idle 
> gpus
> 
> Thanks in advance!

-- 
Diego Zuccato
DIFA - Dip. di Fisica e Astronomia
Servizi Informatici
Alma Mater Studiorum - Università di Bologna
V.le Berti-Pichat 6/2 - 40127 Bologna - Italy
tel.: +39 051 20 95786



More information about the slurm-users mailing list