<html>

  <head>

    <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">

  </head>

  <body>

    <p>JC,</p>

    <p>One thing you will start finding in HPC is that, by it's goal,

      hyperthreading is usually a poor fit.</p>

    <p>If you are properly utilizing your cores, your jobs will actually

      be slowed by using hyperthreading.  They are not 'extra' cores,

      but a method of swapping a core to a different workload during an

      idle cycle. The operative word being 'idle'. The goal of HPC is

      get resource usage as close to 100% as possible, so there should

      be no 'idle' cycles. <br>

    </p>

    <p>From Intel's own info:</p>

    <p><span style="color: rgb(85, 85, 85); font-family: intel-clear,

        tahoma, Helvetica, helvetica, Arial, sans-serif; font-size:

        16px; font-style: normal; font-variant-ligatures: normal;

        font-variant-caps: normal; font-weight: 400; letter-spacing:

        normal; orphans: 2; text-align: start; text-indent: 0px;

        text-transform: none; white-space: normal; widows: 2;

        word-spacing: 0px; -webkit-text-stroke-width: 0px;

        background-color: rgb(243, 243, 243); text-decoration-style:

        initial; text-decoration-color: initial; display: inline

        !important; float: none;">    With CPU Hyper-Threading, a PC can

        process more information in less time and run more background

        tasks without disruption.<span> <br>

        </span></span></p>

    <p>In HPC, we have no background tasks (excluding daemons and such

      on the node), so you have little to nothing that can be improved.

      You end up allocating some of the various Core memory to secondary

      tasks instead of allowing more of it to be used by your primary

      task, leading to more fetches, wasting effort.</p>

    <p>This is a VERY simplistic description, but the point is that

      hyperthreading is not a silver bullet that will improve HPC

      performance if you are maximizing your resource utilization.</p>

    <p>Ok, I will get off my soapbox :)</p>

    <p>Brian Andrus</p>

    <p><br>

    </p>

    <div class="moz-cite-prefix">On 11/4/2020 7:30 AM, Jean-Christophe

      HAESSIG wrote:<br>

    </div>

    <blockquote type="cite"

      cite="mid:400c69ba8ecc0916fa16c682f1157e15734a41ff.camel@igbmc.fr">

      <pre class="moz-quote-pre" wrap="">Hi,

I would like to make good use of hyperthreaded processors and I already

skimmed through a quantity of posts and documentation.

It is pretty clear that Slurm likes to allocate processing units up to

the core level, and to be able to allocate threads one has to either :

- not declare Sockets/Core/Threads at all, only CPUS and use

SelectTypeParameters=CR_CPU_Memory

- resort to some trickery to declare threads as cores

However, the two options have shortcomings :

By not declaring S/C/T, options like --ntasks-per-core seem to get

ignored, but some programs would benefit from being able to be isolated

on their cores.

Not declaring S/C/T (e.g. C=C*T;T=1) as they are in reality also seems

to mess up CPU binding since Slurm does not have a real view on the

node's topology.

When S/C/T are properly declared (with SelectTypeParameters=

CR_Core_Memory), I run into other problems since the allocator thinks I

want 2 CPUS even if I asked for --ntasks=1. In combination with --mem-

per-cpu, this leads to twice memory allocated as was asked for (which

resembles bug #3879).

Finally, while the documentation explains in several places that

processing units should be allocated best by core, I miss an

explanation on why.

Any other options/advice ? Feel free to correct me if I didn't get

something right.

Thanks,

J.C. Haessig

</pre>

    </blockquote>

  </body>

</html>