[slurm-users] can't use GRES to increase Raw Usage
Erik Bryer
ebryer at isi.edu
Thu Jan 7 22:48:03 UTC 2021
I have 2 partitions:
PartitionName=scavenge Nodes=saga-test01,saga-test02 MaxTime=72:00:00 State=UP PriorityTier=0 PreemptMode=REQUEUE AllowQos=scavenge AllowAccounts=borrowed,gaia default=yes TRESBillingWeights="CPU=1.0,Mem=0.25G,GRES/foolsgold=200000"
PartitionName=scavtres Nodes=saga-test01,saga-test02 MaxTime=72:00:00 State=UP PriorityTier=0 PreemptMode=REQUEUE AllowQos=scavenge AllowAccounts=borrowed,gaia default=yes TRESBillingWeights="CPU=1.0,Mem=0.25G,GRES/foolsgold=200.0"
I run 2 jobs in each partition, each job using 2 gpus. The job in the first partition should bill at a higher rate, but it doesn't. After about 1 hour of running the jobs, I see little increase in RawUsage shown by sshare -a.
$ sshare -a | egrep "(Raw|\-\-\-|borrowed.*sagatest01)"
Account User RawShares NormShares RawUsage EffectvUsage FairShare
-------------------- ---------- ---------- ----------- ----------- ------------- ----------
borrowed sagatest01 1582 0.333333 25911 0.000103 0.166667
RawUsage has increased by 540 after 30 minutes. If I change in the partition definition CPU=1.0 to CPU=200000, the RawUsage numbers leap up. How can I get TRESBillingWeights to acknowledge GRES?
Thanks,
Erik
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20210107/8cd8f666/attachment.htm>
More information about the slurm-users
mailing list