[slurm-users] Compact scheduling strategy for small GPU jobs

Brian Andrus toomuchit at gmail.com
Tue Aug 10 15:42:58 UTC 2021


You may want to look at your resources. If the memory allocation adds up 
such that there isn't enough left for any job to run, it won't matter 
that there are still GPUs available.

Similar for any other resource (CPUs, cores, etc)

Brian Andrus


On 8/10/2021 8:07 AM, Jack Chen wrote:
> Does anyone have any ideas on this?
>
> On Fri, Aug 6, 2021 at 2:52 PM Jack Chen <scsvip at gmail.com 
> <mailto:scsvip at gmail.com>> wrote:
>
>     I'm using slurm15.08.11, when I submit several 1 gpu jobs, slurm
>     doesn't allocate nodes using compact strategy. Anyone know how to
>     solve this? Will upgrading slurm latest version help ?
>
>     For example, there are two nodes A and B with 8 gpus per node, I
>     submitted 8 1 gpu jobs, slurm will allocate first 6 jobs on node
>     A, then last 2 jobs on node B. Then when I submit one job with 8
>     gpus, it will pending because of gpu fragments: nodes A has 2 idle
>     gpus, node b 6 idle gpus
>
>     Thanks in advance!
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20210810/0f40181b/attachment.htm>


More information about the slurm-users mailing list