[slurm-users] [External] Re: serious bug about CUDA_VISBLE_DEVICES in the slurm 17.11.7

Chaofeng Zhang zhangcf1 at lenovo.com
Thu Aug 30 03:18:20 MDT 2018


CUDA_VISBLE_DEVICES is used by many AI framework to determine which gpu to use, like tensorflow. So this environment is critical to us.

-----Original Message-----
From: slurm-users <slurm-users-bounces at lists.schedmd.com> On Behalf Of Chris Samuel
Sent: Thursday, August 30, 2018 4:42 PM
To: slurm-users at lists.schedmd.com
Subject: [External] Re: [slurm-users] serious bug about CUDA_VISBLE_DEVICES in the slurm 17.11.7

On Thursday, 30 August 2018 6:38:08 PM AEST Chaofeng Zhang wrote:

> The CUDA_VISBLE_DEVICES can't be set NoDevFiles in Slurm 17.11.7.  
> This is worked when we use Slurm 17.02.

You probably should be using cgroups instead to constrain access to GPUs.  
Then it doesn't matter what you set CUDA_VISBLE_DEVICES to be as processes will only be able to access what they requested.

Hope that helps!
Chris
--
 Chris Samuel  :  http://www.csamuel.org/  :  Melbourne, VIC







More information about the slurm-users mailing list