<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<style type="text/css" style="display:none;"><!-- P {margin-top:0;margin-bottom:0;} --></style>
</head>
<body dir="ltr">
<div id="divtagdefaultwrapper" style="font-size:12pt;color:#000000;font-family:Calibri,Helvetica,sans-serif;" dir="ltr">
<p>Hi</p>
<p>I made some progress trying to understand the problem i reported some weeks ago:</p>
<p><br>
</p>
<p><a href="https://lists.schedmd.com/pipermail/slurm-users/2023-May/010027.html" class="OWAAutoLink">https://lists.schedmd.com/pipermail/slurm-users/2023-May/010027.html</a><br>
</p>
<p><br>
</p>
<p>I noticed that the intermittent connection timeout that i am experiencing occurs only</p>
<p>when using the tcp based direct connection to establish communication between stepd</p>
<p>on different nodes.</p>
<p>When disabling the optimized direct connection using</p>
<p><span><br>
</span></p>
<p><span>export SLURM_PMIX_DIRECT_CONN=false</span><br>
</p>
<p><br>
</p>
<p>the submission of hetjobs is stable and not</p>
<p>connection timeout occurs anymore.</p>
<p>Any idea what can goes wrong when using tcp based direct connection together with hetjobs?</p>
<div id="Signature">
<div id="divtagdefaultwrapper" dir="ltr" style="font-size: 12pt; color: rgb(0, 0, 0); font-family: Calibri, Helvetica, sans-serif, EmojiFont, "Apple Color Emoji", "Segoe UI Emoji", NotoColorEmoji, "Segoe UI Symbol", "Android Emoji", EmojiSymbols;">
<p></p>
<div><br>
</div>
<div>Cheers,</div>
<div>Denis</div>
<div><br>
</div>
<div><span style="font-size:9pt">---------</span><span style="font-size:9pt"></span></div>
<div><span style="font-size:9pt">Denis Bertini</span></div>
<span style="font-size:9pt"></span>
<div><span style="font-size:9pt">Abteilung: CIT</span><br>
<span style="font-size:9pt"></span></div>
<span style="font-size:9pt"></span>
<div><span style="font-size:9pt">Ort: SB3 2.265a</span></div>
<span style="font-size:9pt"></span>
<div><br>
<span style="font-size:9pt"></span></div>
<span style="font-size:9pt"></span>
<div><span style="font-size:9pt">Tel: +49 6159 71 2240</span></div>
<span style="font-size:9pt"></span>
<div><span style="font-size:9pt">Fax: +49 6159 71 2986</span></div>
<span style="font-size:9pt"></span>
<div><span style="font-size:9pt">E-Mail: d.bertini@gsi.de</span></div>
<span style="font-size:9pt"></span>
<div><br>
<span style="font-size:9pt"></span></div>
<span style="font-size:9pt"></span>
<div><span style="font-size:9pt">GSI Helmholtzzentrum für Schwerionenforschung GmbH</span></div>
<span style="font-size:9pt"></span>
<div><span style="font-size:9pt">Planckstraße 1, 64291 Darmstadt, Germany, www.gsi.de</span></div>
<span style="font-size:9pt"></span>
<div><br>
<span style="font-size:9pt"></span></div>
<span style="font-size:9pt"></span>
<div><span style="font-size:9pt">Commercial Register / Handelsregister: Amtsgericht Darmstadt, HRB 1528</span></div>
<span style="font-size:9pt"></span>
<div><span style="font-size:9pt">Managing Directors / Geschäftsführung:</span></div>
<span style="font-size:9pt"></span>
<div><span style="font-size:9pt">Professor Dr. Paolo Giubellino, Dr. Ulrich Breuer, Jörg Blaurock</span></div>
<span style="font-size:9pt"></span>
<div><span style="font-size:9pt">Chairman of the GSI Supervisory Board / Vorsitzender des GSI-Aufsichtsrats:</span></div>
<span style="font-size:9pt"></span>
<div><span style="font-size:9pt">Ministerialdirigent Dr. Volkmar Dietz</span></div>
<p></p>
</div>
</div>
</div>
</body>
</html>