[slurm-users] GPU / cgroup challenges

Christopher Samuel chris at csamuel.org
Tue May 1 17:54:40 MDT 2018


On 02/05/18 09:31, R. Paul Wiegand wrote:

> Slurm 17.11.0 on CentOS 7.1

That's quite old (on both fronts, RHEL 7.1 is from 2015), we started on
that same Slurm release but didn't do the GPU cgroup stuff until a later
version (17.11.3 on RHEL 7.4).

I don't see anything in the NEWS file about relevant cgroup changes
though (there is a cgroup affinity fix but that's unrelated).

You do have identical slurm.conf, cgroup.conf,
cgroup_allowed_devices_file.conf etc on all the compute nodes too?
Slurmd and slurmctld have both been restarted since they were
configured?

All the best,
Chris
-- 
  Chris Samuel  :  http://www.csamuel.org/  :  Melbourne, VIC



More information about the slurm-users mailing list