IIRC the thing you can't find is simply the order in which the nodes are listed in the partition definition (or alphanumerical order if they are added en masse with square brackets)
Dearl Slurm User List,
I am currently reviewing a slurm.conf where the developer set Weight
manually to attribute a greater weight to machines that have more RAM to
force smaller jobs on smaller instances. However, I feel like there is
something already in place or better than manually setting the weights,
but I couldn't find it.
If I understand correctly Slurm does not schedule jobs to the smallest
possible node on default. So small jobs can be scheduled to large
instances and a big job might have to wait indefinitely when using
backfilling.
I thought that Slurm does have mechanisms to prevent this but was unable
to find it again in the documentation.
Is there really no automatism at place or am I overlooking something?
Best,
Xaver
--
slurm-users mailing list -- slurm-users@lists.schedmd.com
To unsubscribe send an email to slurm-users-leave@lists.schedmd.com