<html>
<head>
<meta http-equiv="Content-Type" content="text/html;
charset=windows-1252">
</head>
<body>
<p>Suspend is really nothing more than hitting ^S on the job, so
there is no interaction between it and the partition once it gets
running.</p>
<p>What behavior would you expect? Suspend is not cancel, which
would need to be done to get the job out of that partition (even
if it were checkpoint, then cancel to be resumed on another node).</p>
<p>Brian Andrus<br>
</p>
<p><br>
</p>
<div class="moz-cite-prefix">On 3/24/2021 7:31 AM, Gestió Servidors
wrote:<br>
</div>
<blockquote type="cite"
cite="mid:VI1PR0701MB2895103D7CC136F083C92EC9F2639@VI1PR0701MB2895.eurprd07.prod.outlook.com">
<meta http-equiv="Content-Type" content="text/html;
charset=windows-1252">
<meta name="Generator" content="Microsoft Word 15 (filtered
medium)">
<style>@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
font-size:11.0pt;
font-family:"Calibri",sans-serif;
mso-fareast-language:EN-US;}span.EstiloCorreo17
{mso-style-type:personal-compose;
font-family:"Calibri",sans-serif;
color:windowtext;}.MsoChpDefault
{mso-style-type:export-only;
font-family:"Calibri",sans-serif;
mso-fareast-language:EN-US;}div.WordSection1
{page:WordSection1;}</style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
<div class="WordSection1">
<p class="MsoNormal">Hi,<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">I have got this new question for you: <o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">In my cluster there is a running job. Then,
I change a partition state from “up” to “down”. Then, that job
continues “running” because it was already running before the
state had changed. Now, I run explicitly a “scontrol suspend
my_job”. After it, my job remains at the queue because of it
is suspended and, also, I have change partition status to
“down”. After 1 hour (for example), I run “scontrol resume
myjob” and, I don’t know why, job continues “running”… in a
partition than is still “down”. Why?<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Thanks<o:p></o:p></p>
</div>
</blockquote>
</body>
</html>