[slurm-users] Building Slurm RPMs with NVIDIA GPU support?

Ole Holm Nielsen Ole.H.Nielsen at fysik.dtu.dk
Tue Jan 26 19:29:43 UTC 2021


In another thread, On 26-01-2021 17:44, Prentice Bisbal wrote:
> Personally, I think it's good that Slurm RPMs are now available through 
> EPEL, although I won't be able to use them, and I'm sure many people on 
> the list won't be able to either, since licensing issues prevent them 
> from providing support for NVIDIA drivers, so those of us with GPUs on 
> our clusters will still have to compile Slurm from source to include 
> NVIDIA GPU support.

We're running Slurm 20.02.6 and recently added some NVIDIA GPU nodes.
The Slurm GPU documentation seems to be
https://slurm.schedmd.com/gres.html
We don't seem to have any problems scheduling jobs on GPUs, even though 
our Slurm RPM build host doesn't have any NVIDIA software installed, as 
shown by the command:
$ ldconfig -p | grep libnvidia-ml

I'm curious about Prentice's statement about needing NVIDIA libraries to 
be installed when building Slurm RPMs, and I read the discussion in bug 
9525,
https://bugs.schedmd.com/show_bug.cgi?id=9525
from which it seems that the problem was fixed in 20.02.6 and 20.11.

Question: Is there anything special that needs to be done when building 
Slurm RPMs with NVIDIA GPU support?

Thanks,
Ole



More information about the slurm-users mailing list