<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body>
<p>You may want to look at your resources. If the memory allocation
adds up such that there isn't enough left for any job to run, it
won't matter that there are still GPUs available.</p>
<p>Similar for any other resource (CPUs, cores, etc)</p>
<p>Brian Andrus</p>
<p><br>
</p>
<div class="moz-cite-prefix">On 8/10/2021 8:07 AM, Jack Chen wrote:<br>
</div>
<blockquote type="cite"
cite="mid:CADUsV6h+ekLdWTUByrj5P_Aof038fMfLQxV9m8f-g73PXgMu+w@mail.gmail.com">
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
<div dir="ltr">
<div dir="ltr">Does anyone have any ideas on this?</div>
</div>
<br>
<div class="gmail_quote">
<div dir="ltr" class="gmail_attr">On Fri, Aug 6, 2021 at 2:52 PM
Jack Chen <<a href="mailto:scsvip@gmail.com"
moz-do-not-send="true">scsvip@gmail.com</a>> wrote:<br>
</div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px
0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div dir="ltr">I'm using slurm15.08.11, when I submit several
1 gpu jobs, slurm doesn't allocate nodes using compact
strategy. Anyone know how to solve this? Will upgrading
slurm latest version helpĀ ? <br>
<br>
For example, there are two nodes A and B with 8 gpus per
node, I submitted 8 1 gpu jobs, slurm will allocate first 6
jobs on node A, then last 2 jobs on node B. Then when I
submit one job with 8 gpus, it will pending because of gpu
fragments: nodes A has 2 idle gpus, node b 6 idle gpus<br>
<div><br>
</div>
<div>Thanks in advance!</div>
</div>
</blockquote>
</div>
</blockquote>
</body>
</html>