[slurm-users] 19.05 and GPUs vs GRES

Christopher Benjamin Coffey Chris.Coffey at nau.edu
Tue Aug 13 20:36:35 UTC 2019


Thanks for that Chris! :)

Sounds like other than the new requests for gpu specifics, things should just work when upgrading to 19.05 as slurm is likely backwards compatible with the previous setup gres stuff.

Best,
Chris

—
Christopher Coffey
High-Performance Computing
Northern Arizona University
928-523-1167
 

On 8/12/19, 10:28 PM, "slurm-users on behalf of Chris Samuel" <slurm-users-bounces at lists.schedmd.com on behalf of chris at csamuel.org> wrote:

    On Monday, 12 August 2019 11:42:48 AM PDT Christopher Benjamin Coffey wrote:
    
    > Excuse me if this has been explained somewhere, I did some searching. With
    > 19.05, is there any reason to have gres.conf on the GPU nodes? Is slurm
    > smart enough to enumerate the /dev/nvidia* devices? We are moving to 19.05
    > shortly, any gotchas with GRES and GPUs? Also, I'm guessing now, there is
    > no reason for users to request "--gres:gpu" type stuff anymore and instead
    > use: --gpus=n ?
    
    We do have 19.05 on our GPU nodes, but I've not had time to experiment with 
    the new request syntax just yet.
    
    Regarding configuration it does appear to be that you still need to set them 
    up, but if you link Slurm against the nvidia NVML library at compile time then 
    there is support for autodetection.
    
    https://nam05.safelinks.protection.outlook.com/?url=https%3A%2F%2Fslurm.schedmd.com%2Fgres.html&data=02%7C01%7Cchris.coffey%40nau.edu%7Cfc6ede93f45440fdaf1508d71faf0362%7C27d49e9f89e14aa099a3d35b57b2ba03%7C0%7C0%7C637012708851283210&sdata=lUrvaHgA4jSVgvlcSd9GJBBOZ8dSWSHSNl9ee%2Bv4Xo0%3D&reserved=0
    
    # In the case of GPUs, if AutoDetect=nvml in gres.conf and the NVML library
    # is installed on the node and was present during Slurm configuration, the
    # missing configuration details will be automatically gathered using the
    # NVML library. Configuration information about all other generic resource
    # must explicitly be described in the gres.conf file. 
    
    All the best,
    Chris
    -- 
      Chris Samuel  :  https://nam05.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.csamuel.org%2F&data=02%7C01%7Cchris.coffey%40nau.edu%7Cfc6ede93f45440fdaf1508d71faf0362%7C27d49e9f89e14aa099a3d35b57b2ba03%7C0%7C0%7C637012708851283210&sdata=hnqqFo7C%2FVg60ZmgPZOcianQTcFlcRS5d%2Fl5O4OQCSw%3D&reserved=0  :  Berkeley, CA, USA
    
    
    
    
    



More information about the slurm-users mailing list