[slurm-users] GRES and GPUs

Hermann Schwärzler hermann.schwaerzler at uibk.ac.at
Mon Jul 17 12:11:08 UTC 2023


Hi Xaver,

what kind of SelectType are you using in your slurm.conf?

Per https://slurm.schedmd.com/gres.html you have to consider:
"As for the --gpu* option, these options are only supported by Slurm's 
select/cons_tres plugin."

So you can use "--gpus ..." only when you state
SelectType              = select/cons_tres
in your slurm.conf.

But "--gres=gpu:1" should work always.

Regards
Hermann


On 7/17/23 13:43, Xaver Stiensmeier wrote:
> Hey,
> 
> I am currently trying to understand how I can schedule a job that needs 
> a GPU.
> 
> I read about GRES https://slurm.schedmd.com/gres.html and tried to use:
> 
> GresTypes=gpu
> NodeName=test Gres=gpu:1
> 
> But calling - after a 'sudo scontrol reconfigure':
> 
> srun --gpus 1 hostname
> 
> didn't work:
> 
> srun: error: Unable to allocate resources: Invalid generic resource (gres) specification
> 
> so I read more https://slurm.schedmd.com/gres.conf.html but that didn't 
> really help me.
> 
> 
> I am rather confused. GRES claims to be generic resources but then it 
> comes with three defined resources (GPU, MPS, MIG) and using one of 
> those didn't work in my case.
> 
> Obviously, I am misunderstanding something, but I am unsure where to look.
> 
> 
> Best regards,
> Xaver Stiensmeier
> 



More information about the slurm-users mailing list