[slurm-users] GRES and GPUs
hermann.schwaerzler at uibk.ac.at
Mon Jul 17 12:11:08 UTC 2023
what kind of SelectType are you using in your slurm.conf?
Per https://slurm.schedmd.com/gres.html you have to consider:
"As for the --gpu* option, these options are only supported by Slurm's
So you can use "--gpus ..." only when you state
SelectType = select/cons_tres
in your slurm.conf.
But "--gres=gpu:1" should work always.
On 7/17/23 13:43, Xaver Stiensmeier wrote:
> I am currently trying to understand how I can schedule a job that needs
> a GPU.
> I read about GRES https://slurm.schedmd.com/gres.html and tried to use:
> NodeName=test Gres=gpu:1
> But calling - after a 'sudo scontrol reconfigure':
> srun --gpus 1 hostname
> didn't work:
> srun: error: Unable to allocate resources: Invalid generic resource (gres) specification
> so I read more https://slurm.schedmd.com/gres.conf.html but that didn't
> really help me.
> I am rather confused. GRES claims to be generic resources but then it
> comes with three defined resources (GPU, MPS, MIG) and using one of
> those didn't work in my case.
> Obviously, I am misunderstanding something, but I am unsure where to look.
> Best regards,
> Xaver Stiensmeier
More information about the slurm-users