[slurm-users] Calculate the GPU usages

Paul Raines raines at nmr.mgh.harvard.edu
Wed Sep 1 14:02:21 UTC 2021


Doesn't add up for me either

# sacct -X -P --format=jobid,partition,ElapsedRaw,state,AllocTRES 
--starttime=2021-08-01 --endtime=2021-08-31 --user=jl1103  | grep gpu=

470069|1|FAILED|billing=4,cpu=1,gres/gpu=1,mem=20G,node=1
470351|1|FAILED|billing=4,cpu=1,gres/gpu=1,mem=20G,node=1
470352|1|FAILED|billing=4,cpu=1,gres/gpu=1,mem=20G,node=1
470359|4|FAILED|billing=4,cpu=1,gres/gpu=1,mem=20G,node=1
470370|515|CANCELLED|billing=4,cpu=1,gres/gpu=1,mem=20G,node=1
470395|1140|FAILED|billing=9,cpu=2,gres/gpu=2,mem=40G,node=2
470731|57875|FAILED|billing=4,cpu=1,gres/gpu=1,mem=20G,node=1

That is (1+1+1+4+515+2*1140+57875)/60 = 1011 minutes

# sreport cluster AccountUtilizationByUser start=2021-08-01 
end=2021-08-31 user=jl1103 --tres="gres/gpu"
--------------------------------------------------------------------------------
Cluster/Account/User Utilization 2021-08-01T00:00:00 - 2021-08-30T23:59:59 (2592000 secs)
Usage reported in TRES Minutes
--------------------------------------------------------------------------------
Cluster     Account     Login     Proper Name      TRES Name     Used
------- ----------- --------- --------------- -------------- --------
    mlsc     *******    jl1103     ***********       gres/gpu      390


In slurm.conf for the partition all these jobs ran on I have
TRESBillingWeights="CPU=1.24,Mem=0.02G,Gres/gpu=3.0"  if that effects
the sreport number somehow -- but then I would expect sreport's number
to simply be 3x the sacct number then


-- Paul Raines (http://help.nmr.mgh.harvard.edu)






More information about the slurm-users mailing list