[slurm-users] missing info from sacct
Andy Riebs
andy at candooz.com
Wed Nov 18 14:42:00 UTC 2020
Hi Navin,
I can't help with the sreport problem, but I did recognize the situation
with the gap in job numbers (the use of federation), and jumped in for
that one.
Since this list is completely populated by volunteers, there is no one
"assigned" to topic areas, but people jump in where they can. (That's
one reason that it's a good idea to limit each mail thread to a single
problem.)
In any case, I hope someone can help with your sreport problem.
Andy
On 11/18/2020 9:30 AM, navin srivastava wrote:
> Thank you Andy.
>
> but when i am trying to get the utilization for the months it says it
> is 100%.
> when i tried to find it using utilization by user it gives me a very
> different value which i am unable to understand.
>
> deda1x1466:~ # sreport cluster AccountUtilizationByUser
> start=10/02/20 end=10/02/20 cluster=hpc2 -t HOUR --tres=cpu
> --------------------------------------------------------------------------------
> Cluster/Account/User Utilization 2020-10-02T00:00:00 -
> 2020-10-02T00:59:59 (3600 secs)
> Usage reported in TRES Hours
> --------------------------------------------------------------------------------
> Cluster Account Login Proper Name TRES Name
> Used
> --------- --------------- --------- --------------- --------------
> ---------
> hpc2 root cpu 68159
> hpc2 stdg_acc cpu 68159
> hpc2 stdg_acc m219018 Harbach Philipp cpu 317
> hpc2 stdg_acc m253000 Morin Valerie cpu 12
> hpc2 stdg_acc m254746 Lippolis Eleon+ cpu 9
> hpc2 stdg_acc m258464 Wurl Andreas cpu 96
> hpc2 stdg_acc m262230 Schmelzer Maxi+ cpu 2
> hpc2 stdg_acc m270962 Heidrich Johan+ cpu 67647
> hpc2 stdg_acc m271803 Hermsen Marko cpu 46
> hpc2 stdg_acc m275696 Ploetz Tobias cpu 10
> hpc2 stdg_acc m278452 Brandenburg Ja+ cpu 19
> hpc2 stdg_acc m290493 cpu 1
>
> How it is calculating the hour in a day .
>
> Regards
> Navin.
>
>
>
> On Wed, Nov 18, 2020 at 7:51 PM Andy Riebs <andy at candooz.com
> <mailto:andy at candooz.com>> wrote:
>
> I see from your subsequent post that you're using a pair of clusters
> with a single database, so yes, you are using federation.
>
> The high order bits of the Job ID identify the cluster that ran
> the job,
> so you will typically have a huge gap between ranges of Job IDs.
>
> Andy
>
> On 11/18/2020 9:15 AM, Andy Riebs wrote:
> > Are you using federated clusters? If not, check slurm.conf -- do
> you
> > have FirstJobId set?
> >
> > Andy
> >
> > On 11/18/2020 8:42 AM, navin srivastava wrote:
> >> While running the sacct we found that some jobid are not listing.
> >>
> >> 5535566 SYNTHLIBT+ stdg_defq stdg_acc 1 COMPLETED
> >> 0:0
> >> 5535567 SYNTHLIBT+ stdg_defq stdg_acc 1 COMPLETED
> >> 0:0
> >> 11016496 jupyter-s+ stdg_defq stdg_acc 1 RUNNING
> >> 0:0
> >> 11016496.ex+ extern stdg_acc 1 COMPLETED
> >> 0:0
> >>
> >> Not able to see the jobid in between these range in sacct info.
> >> Any hint what went wrong here.
> >>
> >> Regards
> >> Navin.
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20201118/69db7e83/attachment-0001.htm>
More information about the slurm-users
mailing list