<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body text="#000000" bgcolor="#FFFFFF">
Hi Alan,<br>
<br>
we are also seeing this, but that has nothing to do with X11
support, since we compile atm. SLURM without X11 support.<br>
We also see sometimes jobs running on, even if e.g. mpi rank one got
killed by oom, rank zero is stuck in mpi_finalize.<br>
SLURM seems to not detect everytimes, if oom killer was active, thus
not terminating the rest of the mpi-processes.<br>
<br>
Best Marcus<br>
<br>
<div class="moz-cite-prefix">On 5/16/19 9:04 AM, Alan Orth wrote:<br>
</div>
<blockquote type="cite"
cite="mid:CAKKdN4Vh0N3Sz2iO4mroWiSdUHGmp7EoUorosdzLy0L1i5CWVQ@mail.gmail.com">
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<div dir="ltr">
<div dir="ltr">Yes I'm also looking forward to SLURM 19.05. We
have had lots of issues with X11 since we upgraded to 18.08
and started using its built-in X11 support. Part of this was
resolved by setting "X11Parameters=local_xauthority" in
slurm.conf to reduce locking contention on the Xauthority
file, but now we get a handful of nodes drained every day with
reason "Kill task failed". In ten years of using SLURM I've
never had so many problems as I'm having now. :\</div>
<div dir="ltr"><br>
</div>
<div>Regards,<br>
</div>
</div>
<br>
<div class="gmail_quote">
<div dir="ltr" class="gmail_attr">On Wed, May 15, 2019 at 9:40
PM Christopher Samuel <<a href="mailto:chris@csamuel.org"
moz-do-not-send="true">chris@csamuel.org</a>> wrote:<br>
</div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px
0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">On
5/15/19 11:36 AM, Mahmood Naderan wrote:<br>
<br>
> I really like to know why x11 is not so friendly? For
example, slurm <br>
> works with MPI. Why not with X11?!<br>
<br>
Because MPI support is fundamental, X11 support is nice to
have.<br>
<br>
I suspect 19.05 will make your life an awful lot easier!<br>
<br>
All the best,<br>
Chris<br>
-- <br>
Chris Samuel : <a href="http://www.csamuel.org/"
rel="noreferrer" target="_blank" moz-do-not-send="true">http://www.csamuel.org/</a>
: Berkeley, CA, USA<br>
<br>
</blockquote>
</div>
<br clear="all">
<br>
-- <br>
<div dir="ltr" class="gmail_signature">Alan Orth<br>
<a href="mailto:alan.orth@gmail.com" target="_blank"
moz-do-not-send="true">alan.orth@gmail.com</a><br>
<a href="https://picturingjordan.com" target="_blank"
moz-do-not-send="true">https://picturingjordan.com</a><br>
<a href="https://englishbulgaria.net" target="_blank"
moz-do-not-send="true">https://englishbulgaria.net</a><br>
<a href="https://mjanja.ch" target="_blank"
moz-do-not-send="true">https://mjanja.ch</a><br>
"In heaven all the interesting people are missing." ―Friedrich
Nietzsche</div>
</blockquote>
<br>
<pre class="moz-signature" cols="72">--
Marcus Wagner, Dipl.-Inf.
IT Center
Abteilung: Systeme und Betrieb
RWTH Aachen University
Seffenter Weg 23
52074 Aachen
Tel: +49 241 80-24383
Fax: +49 241 80-624383
<a class="moz-txt-link-abbreviated" href="mailto:wagner@itc.rwth-aachen.de">wagner@itc.rwth-aachen.de</a>
<a class="moz-txt-link-abbreviated" href="http://www.itc.rwth-aachen.de">www.itc.rwth-aachen.de</a>
</pre>
</body>
</html>