[slurm-users] missing info from sacct

Andy Riebs andy at candooz.com
Wed Nov 18 14:42:00 UTC 2020


Hi Navin,

I can't help with the sreport problem, but I did recognize the situation 
with the gap in job numbers (the use of federation), and jumped in for 
that one.

Since this list is completely populated by volunteers, there is no one 
"assigned" to topic areas, but people jump in where they can. (That's 
one reason that it's a good idea to limit each mail thread to a single 
problem.)

In any case, I hope someone can help with your sreport problem.

Andy

On 11/18/2020 9:30 AM, navin srivastava wrote:
> Thank you Andy.
>
> but when i am trying to get the utilization for the months it says it 
> is 100%.
> when i tried to find it using utilization by user it gives me a very 
> different value which i am unable to understand.
>
> deda1x1466:~ # sreport cluster AccountUtilizationByUser 
>  start=10/02/20  end=10/02/20 cluster=hpc2 -t HOUR --tres=cpu
> --------------------------------------------------------------------------------
> Cluster/Account/User Utilization 2020-10-02T00:00:00 - 
> 2020-10-02T00:59:59 (3600 secs)
> Usage reported in TRES Hours
> --------------------------------------------------------------------------------
>   Cluster         Account     Login     Proper Name      TRES Name     
>  Used
> --------- --------------- --------- --------------- -------------- 
> ---------
> hpc2            root              cpu     68159
> hpc2        stdg_acc          cpu     68159
> hpc2        stdg_acc   m219018 Harbach Philipp            cpu       317
> hpc2        stdg_acc   m253000   Morin Valerie            cpu        12
> hpc2        stdg_acc   m254746 Lippolis Eleon+            cpu         9
> hpc2        stdg_acc   m258464    Wurl Andreas            cpu        96
> hpc2        stdg_acc   m262230 Schmelzer Maxi+            cpu         2
> hpc2        stdg_acc   m270962 Heidrich Johan+            cpu     67647
> hpc2        stdg_acc   m271803   Hermsen Marko            cpu        46
> hpc2        stdg_acc   m275696   Ploetz Tobias            cpu        10
> hpc2        stdg_acc   m278452 Brandenburg Ja+            cpu        19
> hpc2        stdg_acc   m290493                            cpu         1
>
> How it is calculating the hour in a day .
>
> Regards
> Navin.
>
>
>
> On Wed, Nov 18, 2020 at 7:51 PM Andy Riebs <andy at candooz.com 
> <mailto:andy at candooz.com>> wrote:
>
>     I see from your subsequent post that you're using a pair of clusters
>     with a single database, so yes, you are using federation.
>
>     The high order bits of the Job ID identify the cluster that ran
>     the job,
>     so you will typically have a huge gap between ranges of Job IDs.
>
>     Andy
>
>     On 11/18/2020 9:15 AM, Andy Riebs wrote:
>     > Are you using federated clusters? If not, check slurm.conf -- do
>     you
>     > have FirstJobId set?
>     >
>     > Andy
>     >
>     > On 11/18/2020 8:42 AM, navin srivastava wrote:
>     >> While running the sacct we found that some jobid are not listing.
>     >>
>     >> 5535566      SYNTHLIBT+  stdg_defq   stdg_acc  1  COMPLETED
>     >>    0:0
>     >> 5535567      SYNTHLIBT+  stdg_defq   stdg_acc  1  COMPLETED
>     >>    0:0
>     >> 11016496     jupyter-s+  stdg_defq   stdg_acc  1  RUNNING
>     >>  0:0
>     >> 11016496.ex+     extern              stdg_acc  1  COMPLETED
>     >>    0:0
>     >>
>     >>  Not able to see the jobid in between these range in sacct info.
>     >>  Any hint what went wrong here.
>     >>
>     >> Regards
>     >> Navin.
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20201118/69db7e83/attachment-0001.htm>


More information about the slurm-users mailing list