[slurm-users] Slurm configuration, Weight Parameter

Jim Prewett download at carc.unm.edu
Thu Nov 21 15:44:28 UTC 2019


Hi Sistemas,

I could be mistaken, but I don't think there is a way to require jobs on 
the 3GB nodes to request more than 2GB!

https://slurm.schedmd.com/slurm.conf.html states this: "Note that if a job 
allocation request can not be satisfied using the nodes with the lowest 
weight, the set of nodes with the next lowest weight is added to the set 
of nodes under consideration for use (repeat as needed for higher weight 
values)."

I read that to mean "if there are only 3GB nodes available, jobs will be 
run there reguardless of the memory needed."  We had a similar request but 
were unable to find a solution (and, ultimately the particular user is 
happier to not have idle machines when there's work to be done!).

If I'm misunderstanding, I'd love to know!

HTH,
Jim

On Thu, 21 Nov 2019, Sistemas NLHPC wrote:

> Hi all,
>
> Currently we have two types of nodes, one with 3GB and another with 2GB of
> RAM, it is required that in nodes of 3 GB it is not allowed to execute
> tasks with less than 2GB, to avoid underutilization of resources.
>
> This, because we have nodes that can fulfill the condition of executing
> tasks with 2GB or less.
>
> I try in the nodes configuration with the option "Weight".I send multiples
> jobs but slurm not asigned by "Weight", it's arbitrary in the order how
> send jobs. Some configuration and logs:
>
> slurm.conf
>
> NodeName=DEFAULT RealMemory=3007 Features=3007MB Weight=500 State=idle
> Sockets=2 CoresPerSocket=1
> NodeName=devcn050
>
> NodeName=DEFAULT RealMemory=3007 Features=3007MB Weight=100 State=idle
> Sockets=2 CoresPerSocket=1
> NodeName=devcn002
>
> NodeName=DEFAULT RealMemory=2000 Features=2000MB Weight=1 State=idle
> Sockets=2 CoresPerSocket=1
> NodeName=devcn001
>
> Extra information, I see that slurm assing Weight in the node.
>
> # sinfo -N -l
>
> NODELIST   NODES PARTITION       STATE CPUS    S:C:T MEMORY TMP_DISK WEIGHT
> AVAIL_FE REASON
> devcn001       1              slims*               idle       2
> 2:1:1   2000           0                1             2000MB    none
>
> devcn002       1              slims*               idle       2
> 2:1:1   3007           0             100            3007MB    none
>
> devcn050       1              slims*               idle       2
> 2:1:1   3007           0             500            3007MB    none
>
> I test other settings, such as the TRESWeigths parameter with no results,
> for example:
>
> NodeName=devcn001 TRESWeights="CPU=2.0,Mem=2000MB"
>
> Too PriorityType=priority/multifactor plugin is also activated and
> deactivated to test, but in all these cases it does not work.
>
> Thanks in advance.
>
> Regards.
>

James E. Prewett                    Jim at Prewett.org download at hpc.unm.edu
Systems Team Leader           LoGS: http://www.hpc.unm.edu/~download/LoGS/
Designated Security Officer         OpenPGP key: pub 1024D/31816D93
HPC Systems Engineer III   UNM HPC  505.277.8210



More information about the slurm-users mailing list