[slurm-users] can't get fairshare to be calculated per partition

Igor Feghali igor.feghali at gmail.com
Tue Oct 29 19:42:19 UTC 2019


hi there

i'm pretty new to slurm and trying to learn my way through it's many
configurations. my config looks like:

SchedulerType=sched/backfill
PriorityType=priority/multifactor
PriorityWeightAge=10000
PriorityWeightFairshare=10000
PriorityWeightJobSize=5000
PriorityWeightPartition=10000
PriorityWeightQOS=10000

and partitions:

# Node Configurations
NodeName=nodeb[01-10]n[01-05] NodeAddr=nodeb[01-10]n[01-05] CPUs=8
RealMemory=95000 TmpDisk=1700000
PartitionName=part1 Nodes=nodeb[01-10]n[01-05] LLN=YES Default=YES
MaxTime=INFINITE State=Up

NodeName=nodeb[01-10]n[06-46] NodeAddr=nodeb[01-10]n[06-46] CPUs=8
RealMemory=95000 TmpDisk=1700000
PartitionName=part1b Nodes=nodeb[01-10]n[06-46] MaxTime=INFINITE State=Up

NodeName=nodeb[01-10]n[47-48] NodeAddr=nodeb[01-10]n[47-48] CPUs=36
RealMemory=190000 TmpDisk=3500000
PartitionName=part2 Nodes=nodeb[01-10]n[47-48] MaxTime=INFINITE State=Up

NodeName=nodeb12n[01-04] NodeAddr=nodeb12n[01-04] CPUs=8 RealMemory=380000
TmpDisk=5600000
PartitionName=part3 Nodes=nodeb12n[01-04] LLN=YES MaxTime=INFINITE State=Up
PartitionName=part3b Nodes=nodeb12n[01-04] LLN=YES MaxTime=INFINITE State=Up


i'm trying to distribute jobs in partitions evenly per account. that is,
for a given partition if I have jobs queued for accounts A and B I expect
the partition to be split approximately 50/50 between those two accounts
(no matter how the other partitions look like).
but things are not working as I expect and as I can tell fairshare is been
calculated for the entire cluster and not per partition.

sample output of `sshare -lm":

             Account       User    Partition  RawShares  NormShares
 RawUsage   NormUsage  EffectvUsage  FairShare
 GrpTRESMins                    TRESRunMin
-------------------- ---------- ------------ ---------- -----------
----------- ----------- ------------- ----------
------------------------------ -----------------------------
root                                                       1.000000
 1059109661                  1.000000   0.500000
     cpu=0,mem=0,energy=0,node=0,b
 root                      root                       1    0.043478
  0    0.000000      0.000000   1.000000
 cpu=0,mem=0,energy=0,node=0,b
 2dpama38b                                            1    0.043478
  0    0.000000      0.000000   1.000000
 cpu=0,mem=0,energy=0,node=0,b
 2pama38                                              1    0.043478
31906    0.000030      0.000030   0.999520
 cpu=0,mem=0,energy=0,node=0,b
etc.

as you can see the partition column is not been populated so I guess that's
my problem. i am having a hard time to fix it so any tip would be
appreciated.

thank you.

side note: part3 and part3b are identical because I need jobs to be split
evenly between those two partitions. if one of them has no jobs than the
other partition gets all the machines. I have no clue how to do that but
that's a subject for a future email.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20191029/e41a77c8/attachment.htm>


More information about the slurm-users mailing list