[slurm-users] 19.05 and GPUs vs GRES
Christopher Benjamin Coffey
Chris.Coffey at nau.edu
Tue Aug 13 20:36:35 UTC 2019
Thanks for that Chris! :)
Sounds like other than the new requests for gpu specifics, things should just work when upgrading to 19.05 as slurm is likely backwards compatible with the previous setup gres stuff.
Northern Arizona University
On 8/12/19, 10:28 PM, "slurm-users on behalf of Chris Samuel" <slurm-users-bounces at lists.schedmd.com on behalf of chris at csamuel.org> wrote:
On Monday, 12 August 2019 11:42:48 AM PDT Christopher Benjamin Coffey wrote:
> Excuse me if this has been explained somewhere, I did some searching. With
> 19.05, is there any reason to have gres.conf on the GPU nodes? Is slurm
> smart enough to enumerate the /dev/nvidia* devices? We are moving to 19.05
> shortly, any gotchas with GRES and GPUs? Also, I'm guessing now, there is
> no reason for users to request "--gres:gpu" type stuff anymore and instead
> use: --gpus=n ?
We do have 19.05 on our GPU nodes, but I've not had time to experiment with
the new request syntax just yet.
Regarding configuration it does appear to be that you still need to set them
up, but if you link Slurm against the nvidia NVML library at compile time then
there is support for autodetection.
# In the case of GPUs, if AutoDetect=nvml in gres.conf and the NVML library
# is installed on the node and was present during Slurm configuration, the
# missing configuration details will be automatically gathered using the
# NVML library. Configuration information about all other generic resource
# must explicitly be described in the gres.conf file.
All the best,
Chris Samuel : https://nam05.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.csamuel.org%2F&data=02%7C01%7Cchris.coffey%40nau.edu%7Cfc6ede93f45440fdaf1508d71faf0362%7C27d49e9f89e14aa099a3d35b57b2ba03%7C0%7C0%7C637012708851283210&sdata=hnqqFo7C%2FVg60ZmgPZOcianQTcFlcRS5d%2Fl5O4OQCSw%3D&reserved=0 : Berkeley, CA, USA
More information about the slurm-users