Manisha Yadav via slurm-users slurm-users@lists.schedmd.com writes:
To achieve this, I attempted to use QoS by creating a floating partition with some of the nodes and configuring a QoS with priority. I also set a limit with GrpTRES=gres/gpu=24, given that each node has 8 GPUs, and there are 3 nodes in total.
If there are more nodes with GPUs, this will not prevent these users from getting GPUs on more than 3 nodes, it will only prevent them from getting more than 24 GPUs. It will not prevent them from running cpu-only jobs on other nodes either.. I think using GrpTRES=gres/gpu=24,node=3 (or perhaps simply GrpTRES=node=3) should work.