[slurm-users] GPU / cgroup challenges
Christopher Samuel
chris at csamuel.org
Tue May 1 17:54:40 MDT 2018
On 02/05/18 09:31, R. Paul Wiegand wrote:
> Slurm 17.11.0 on CentOS 7.1
That's quite old (on both fronts, RHEL 7.1 is from 2015), we started on
that same Slurm release but didn't do the GPU cgroup stuff until a later
version (17.11.3 on RHEL 7.4).
I don't see anything in the NEWS file about relevant cgroup changes
though (there is a cgroup affinity fix but that's unrelated).
You do have identical slurm.conf, cgroup.conf,
cgroup_allowed_devices_file.conf etc on all the compute nodes too?
Slurmd and slurmctld have both been restarted since they were
configured?
All the best,
Chris
--
Chris Samuel : http://www.csamuel.org/ : Melbourne, VIC
More information about the slurm-users
mailing list