<div dir="ltr"><div class="gmail_default" style="font-family:verdana,sans-serif;color:#000000">It is a University "supercomputer", not a national facility. Hence they are not that expert, which is why I am asking here. I am pretty certain that it is some form of communication issue, but beyond that it is not clear.</div><div class="gmail_default" style="font-family:verdana,sans-serif;color:#000000"><br></div><div class="gmail_default" style="font-family:verdana,sans-serif;color:#000000">If I get suggestions such as "why don't they look for ABC in XYZ" then I may persuade them to look at specifics. They will need the coaching, alas.</div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Wed, Dec 20, 2023 at 1:25 PM Gerhard Strangar <<a href="mailto:g.s@arcor.de">g.s@arcor.de</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Laurence Marks wrote:<br>
<br>
> After some (irreproducible) time, often one of the three slow tasks hangs.<br>
> A symptom is that if I try and ssh into the main node of the subtask (which<br>
> is running 128 mpi on the 4 nodes) I get "Authentication failed".<br>
<br>
How about asking an admin to check why it hangs?<br>
<br>
</blockquote></div><br clear="all"><div><br></div><span class="gmail_signature_prefix">-- </span><br><div dir="ltr" class="gmail_signature"><div dir="ltr">Emeritus Professor Laurence Marks (Laurie)<div>Northwestern University<br><div><a href="http://www.numis.northwestern.edu" target="_blank">Webpage</a> and <a href="http://scholar.google.com/citations?user=zmHhI9gAAAAJ&hl=en" target="_blank">Google Scholar link</a></div><div>"Research is to see what everybody else has seen, and to think what nobody else has thought", Albert Szent-Györgyi</div></div></div></div>