<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<p>Restarting slurmd should be fine assuming they come back before
the communications time out. I restart slurmd's all the time and
haven't had any real problems.</p>
<p>-Paul Edmon-<br>
</p>
<br>
<div class="moz-cite-prefix">On 7/27/2018 6:51 PM, Chris Harwell
wrote:<br>
</div>
<blockquote type="cite"
cite="mid:CAMGOUy6JwxhY2UyehsrVrjmRwf5FC8T+GsgS_XDWxq2RoF78Gg@mail.gmail.com">
<meta http-equiv="content-type" content="text/html; charset=utf-8">
Ot is possible, but double check your config for timeouts first.<br>
<br>
<div class="gmail_quote">
<div dir="ltr">On Fri, Jul 27, 2018, 15:31 Prentice Bisbal <<a
href="mailto:pbisbal@pppl.gov" moz-do-not-send="true">pbisbal@pppl.gov</a>>
wrote:<br>
</div>
<blockquote class="gmail_quote" style="margin:0 0 0
.8ex;border-left:1px #ccc solid;padding-left:1ex">Slurm-users,<br>
<br>
I'm still learning Slurm, so I have what I think is a basic
question. <br>
Can you restart slurmd on nodes where jobs are running, or
will that <br>
kill the jobs? I ran into the same problem as described here:<br>
<br>
<a href="https://bugs.schedmd.com/show_bug.cgi?id=3535"
rel="noreferrer" target="_blank" moz-do-not-send="true">https://bugs.schedmd.com/show_bug.cgi?id=3535</a><br>
<br>
I believe the best way to fix this is to restart slurmd on all
my nodes, <br>
but I've been unable to determine conclusively whether I can
do that w/o <br>
killing running jobs. I've spent some time googling this, but
couldn't <br>
find a definitive answer one way or the other. I prefer to not
kill a <br>
bunch of user jobs on a Friday afternoon.<br>
<br>
-- <br>
Prentice<br>
<br>
<br>
</blockquote>
</div>
-- <br>
<div dir="ltr" class="gmail_signature"
data-smartmail="gmail_signature">
<div dir="ltr">Chris Harwell</div>
</div>
</blockquote>
<br>
</body>
</html>