[slurm-users] Unconfigured GPUs being allocated
chris at csamuel.org
Fri Jul 14 17:57:28 UTC 2023
On 7/14/23 10:20 am, Wilson, Steven M wrote:
> I upgraded Slurm to 23.02.3 but I'm still running into the same problem.
> Unconfigured GPUs (those absent from gres.conf and slurm.conf) are still
> being made available to jobs so we end up with compute jobs being run on
> GPUs which should only be used
I think this is expected - it's not that Slurm is making them available,
it's that it's unaware of them and so doesn't control them in the way it
does for the GPUs it does know about. So you get the default behaviour
(any process can access them).
If you want to stop them being accessed from Slurm you'd need to find a
way to prevent that access via cgroups games or similar.
All the best,
Chris Samuel : http://www.csamuel.org/ : Berkeley, CA, USA
More information about the slurm-users