[slurm-users] correlating 'sinfo -o %C' numbers with 'scontrol show nodes' numbers?

Alexis Huxley alexis.huxley at mpcdf.mpg.de
Fri May 18 11:46:14 MDT 2018


I'm running the commands:

    sinfo -o %C

and

    scontrol --oneliner show nodes

and trying to sum up numbers in the second command to make the numbers
in the first command. I'm doing this mainly for my understanding of what
each number in the first command *really* means.

The easy ones are:

total CPUs:     in the first command it's the 4th number
                in the second command it's the sum of all the 'CPUTot' values

allocated CPUs: in the first command it's the 1st number
                in the second command it's the sum of all the 'CPUAlloc' values

But the other two are not so easy:

other CPUs:     in the first command it's the 3rd number
                in the second command it's *almost* the sum of the 'CPUTot' values
                    for all nodes in state DOWN, DOWN* or IDLE+DRAIN

idle CPUs:      in the first command it's the 2nd number
                in the second command it's *almost* the sum of the 'CPUTot' values
                    for all nodes in state IDLE

The cluster I'm looking at also has some nodes in state MIXED and I'm
wondering if, if I could break their CPUTots down then I'd get all
my sums to balance.

So my questions are:

1) would adding the broken down MIXED CPU counts make the sums balance? 

2a) if no, then what am I missing?

2b) if yes, then how can I break down the MIXED CPU counts, preferably using
   'scontrol show nodes' with some extra options?

I'm not sure if this is relevant, but I have some hidden partitions and
some nodes that are (temporarily) not allocated to any partition. I mention
this because I see 'scontrol' has a '--hide' option.

Also, we have 'Shared=EXCLUSIVE' for all partitions, which should make 
any formulae simpler, but I'd be interested in more general formulae.

Advice sought, many thanks!

Alexis



More information about the slurm-users mailing list