[slurm-users] TotalCPU: sacct reporting inexplicable high values

nico.faerber at id.unibe.ch nico.faerber at id.unibe.ch
Fri Feb 1 15:43:38 UTC 2019


Hi


While doing some statistics on efficient CPU usage, I realized that sacct is reporting inexplicable (at least for me) high values for TotalCPU, UserCPU and SystemCPU. Here is a simple example (each job step is a infinite while loop):


sacct -j 64338003 --format=jobid,elapsed,ncpus,cputime,totalcpu,usercpu,systemcpu,nodelist

       JobID    Elapsed      NCPUS    CPUTime   TotalCPU    UserCPU  SystemCPU        NodeList

------------ ---------- ---------- ---------- ---------- ---------- ---------- ---------------

64338003       00:02:29           4      00:09:56    13:19:41     13:19:36    00:05.054          anode033

64338003.ba+   00:02:31        4      00:10:04    00:09.017    00:04.003  00:05.014          anode033

64338003.ex+   00:02:30        4      00:10:00    00:00.001    00:00:00    00:00.001          anode033

64338003.0     00:02:32          1      00:02:32    03:19:52     03:19:52    00:00.013          anode033

64338003.1     00:02:32          1      00:02:32    03:19:54     03:19:54    00:00.008          anode033

64338003.2     00:02:32          1      00:02:32    03:19:53     03:19:53    00:00.010          anode033

64338003.3     00:02:32          1      00:02:32    03:19:52     03:19:52    00:00.007          anode033


I would expect CPUTime to be the upper limit for TotalCPU.


Looking at cpuacct.stat for job step3:


cat /cgroup/cpuacct/slurm/uid_6994/job_64338003/step_3/cpuacct.stat

user 14902       (~149 = 00:02:29)

system 0


This value corresponds to the expected CPU usage of a single job step.


We are running Slurm 18.08.4 with

JobAcctGatherType=jobacct_gather/cgroup


Does anyone have an explanation for those high values reported by sacct?



Best,

Nico


Universitaet Bern
Abt. Informatikdienste

Nico Färber
High Performance Computing

Gesellschaftsstrasse 6
CH-3012 Bern
Raum 104
Tel. +41 (0)31 631 51 89
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20190201/7c785e47/attachment-0001.html>


More information about the slurm-users mailing list