[slurm-users] how to know the real utilization of a node when oversubscribe is set to FORCE (Mark Hahn)

肖正刚 guru.novice at gmail.com
Fri Jul 17 03:15:07 UTC 2020


Hi, Hahn
I mean the elapsed time.
In the example I mentioned earlier,a job run 10s ,the elapsed time from
sacct is 640s(10s*64), so how do i get the real elapsed time from sacct or
other command line tools?

As you mentioned, i checked usercpu/systemcpu/totalcpu, but they all zero
Query command: sacct -T -X -S 2020-07-16T00:00:00 -E 2020-07-16T23:59:59 -r
$partition
--format=cputimeraw,usercpu,systemcpu,totalcpu,alloccpus,allocnodes,AllocTRES%40
CPUTimeRAW    UserCPU  SystemCPU   TotalCPU  AllocCPUS AllocNodes
                      AllocTRES
---------- ---------- ---------- ---------- ---------- ----------
----------------------------------------
       640   00:00:00   00:00:00   00:00:00         64          1
       billing=64,cpu=64,node=1
       640   00:00:00   00:00:00   00:00:00         64          1
       billing=64,cpu=64,node=1
       640   00:00:00   00:00:00   00:00:00         64          1
       billing=64,cpu=64,node=1
       640   00:00:00   00:00:00   00:00:00         64          1
       billing=64,cpu=64,node=1
       640   00:00:00   00:00:00   00:00:00         64          1
       billing=64,cpu=64,node=1
       640   00:00:00   00:00:00   00:00:00         64          1
       billing=64,cpu=64,node=1
       640   00:00:00   00:00:00   00:00:00         64          1
       billing=64,cpu=64,node=1

Jobs use 1/2/4/8/16/32/64 cores, but elapsed time are the same from sacct.
The real elasped time should be 10/20/40/80/160/320/640

regards.


>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Thu, 16 Jul 2020 11:03:07 -0400 (EDT)
> From: Mark Hahn <hahn at mcmaster.ca>
> To: Slurm User Community List <slurm-users at lists.schedmd.com>
> Subject: Re: [slurm-users] how to know the real utilization of a node
>         when oversubscribe is set to FORCE
> Message-ID:
>         <alpine.LFD.2.02.2007161036410.16842 at coffee.psychology.mcmaster.ca
> >
> Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed
>
> > srun -N 1 -n 1 -p testA sleep 10
> > then the cpurawtime of this job recorded by slurm is 640s, but actually
> > this job only used 10s;
> > so, I want to know are there any way to get the real cputime used by this
> > job in slurm.
>
> if you really mean cpu time (compute-bound, not elapsed),
> then don't you just want usercpu, systemcpu and totalcpu from sacct?
>
> cputime/cputimeraw is just ncpus * elapsed.
>
> regards,
> --
> operator may differ from spokesperson.              hahn at mcmaster.ca
>
>
>
> ------------------------------
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20200717/b3775c65/attachment.htm>


More information about the slurm-users mailing list