[slurm-users] FairShare
Renfro, Michael
Renfro at tntech.edu
Wed Dec 2 17:32:43 UTC 2020
Yesterday, I posted https://docs.rc.fas.harvard.edu/kb/fairshare/<https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fdocs.rc.fas.harvard.edu%2Fkb%2Ffairshare%2F&data=04%7C01%7Crenfro%40tntech.edu%7Cc23f89dcb97743ee5eda08d8960679ed%7C66fecaf83dc04d2cb8b8eff0ddea46f0%7C1%7C1%7C637424301864169250%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000&sdata=%2FnB4ivZeDNrVZiaeupFnAj86oQLOhMu1%2FK6YiuBxTB8%3D&reserved=0> in response to a similar question. If you want the simplest general explanation for FairShare values, it's that they range from 0.0 to 1.0, values above 0.5 indicate that account or user has used less than their share of the resource, and values below 0.5 indicate that that account or user has used more than their share of the resource.
Since all your users have the same RawShares value and are entitled to the same share of the resource, you can see that bdehaven has the most RawUsage and the lowest FairShare value, followed by ajoel and xtsao with almost identical RawUsage and FairShare, and finally ahantau with very little usage and the highest FairShare value.
We use FairShare here as the dominant factor in priorities for queued jobs: if you're a light user, we bump up your priority over heavier users, and your job starts quicker than those for heavier users, assuming all other job attributes are equal.
All these values are relative: in our setup, we'd bump ahantau's pending jobs ahead of the others, and put bdehaven's at the end. But if root needed to run a job outside the sray account, they'd get an enormous bump ahead since the sray account has used far more than its fair share of the resource.
From: slurm-users <slurm-users-bounces at lists.schedmd.com>
Date: Wednesday, December 2, 2020 at 11:23 AM
To: slurm-users at lists.schedmd.com <slurm-users at lists.schedmd.com>
Subject: Re: [slurm-users] FairShare
External Email Warning
This email originated from outside the university. Please use caution when opening attachments, clicking links, or responding to requests.
________________________________
I've read the manual and I re-read the other link. What they boil down to is Fair Share is calculated based on a recondite "rooted plane tree", which I do not have the background in discrete math to understand.
I'm hoping someone can explain it so my little kernel can understand.
________________________________
From: slurm-users <slurm-users-bounces at lists.schedmd.com> on behalf of Micheal Krombopulous <MichealKrombopulous at outlook.com>
Sent: Wednesday, December 2, 2020 9:32 AM
To: slurm-users at lists.schedmd.com <slurm-users at lists.schedmd.com>
Subject: [slurm-users] FairShare
Can someone tell me how to calculate fairshare (under fairtree)? I can't figure it out. I would have thought it would be the same score for all users in an account. E.g., here is one of my accounts:
Account User RawShares NormShares RawUsage NormUsage EffectvUsage LevelFS FairShare
-------------------- ---------- ---------- ----------- ----------- ----------- ------------- ---------- ----------
root 0.000000 611349 1.000000
root root 1 0.076923 0 0.000000 0.000000 inf 1.000000
sray 1 0.076923 30921 0.505582 0.505582 0.152147
sray phedge 1 0.050000 0 0.000000 0.000000 inf 0.181818
sray raab 1 0.050000 0 0.000000 0.000000 inf 0.181818
sray benequist 1 0.050000 0 0.000000 0.000000 inf 0.181818
sray bosch 1 0.050000 0 0.000000 0.000000 inf 0.181818
sray rjenkins 1 0.050000 0 0.000000 0.000000 inf 0.181818
sray esmith 1 0.050000 0 0.000000 0.000000 1.7226e+07 0.054545
sray gheinz 1 0.050000 0 0.000000 0.000000 1.9074e+14 0.072727
sray jfitz 1 0.050000 0 0.000000 0.000000 8.0640e+20 0.081818
sray ajoel 1 0.050000 42449 0.069465 0.137396 0.363913 0.018182
sray jmay 1 0.050000 0 0.000000 0.000000 inf 0.181818
sray aferrier 1 0.050000 0 0.000000 0.000000 inf 0.181818
sray bdehaven 1 0.050000 225002 0.367771 0.727420 0.068736 0.009091
sray msmythe 1 0.050000 0 0.000000 0.000000 inf 0.181818
sray gfink 1 0.050000 0 0.000000 0.000000 2.0343e+05 0.045455
sray ahantau 1 0.050000 31 0.000051 0.000102 491.737549 0.036364
sray hmiller 1 0.050000 0 0.000000 0.000000 inf 0.181818
sray ttinker 1 0.050000 0 0.000000 0.000000 1.4798e+13 0.063636
sray wcooper 1 0.050000 0 0.000000 0.000000 inf 0.181818
sray xtsao 1 0.050000 41734 0.068296 0.135083 0.370143 0.027273
sray xping 1 0.050000 0 0.000000 0.000000 1.9833e+24 0.090909
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20201202/014a7e92/attachment-0001.htm>
More information about the slurm-users
mailing list