[slurm-users] sacct: error

Chris Samuel chris at csamuel.org
Sat May 5 06:42:29 MDT 2018


On Saturday, 5 May 2018 2:45:19 AM AEST Eric F. Alemany wrote:

> With Ray suggestion i have a error message for each nodes. Here i am giving
> you only one error message from a node.
> sacct: error: NodeNames=radonc01 CPUs=32 doesn't match
> Sockets*CoresPerSocket*ThreadsPerCore (16), resetting CPUs
> The interesting thing is if you follow the
> Sockets*CoresPerSocket*ThreadsPerCore formula 2x8x2 = 32  however look above
> and it says (16) - Strange, no ?

No, Slurm is right.   CPUS != threads.  You've got 16 CPU cores, each with 2 
threads.  So in this configuration you can schedule 16 tasks per node and each 
task can use 2 threads.

What does "slurmd -C" say on that node?

All the best,
Chris
-- 
 Chris Samuel  :  http://www.csamuel.org/  :  Melbourne, VIC




More information about the slurm-users mailing list