[slurm-users] Job not running with Resource Reason even though resources appear to be available

Christopher Samuel chris at csamuel.org
Tue Jan 26 21:32:09 UTC 2021


On 1/24/21 8:39 am, Paul Raines wrote:

> I think you have identified the issue here or are very close.  My 
> gres.conf on
> the rtx-04 node for example is:
> 
> AutoDetect=nvml
> Name=gpu Type=quadro_rtx_8000 File=/dev/nvidia0 Cores=0-15
[...]

Ah - you are doing both autodiscovery here and also specifying your 
config manually - it might be worth disabling the auto discovery and 
seeing if that helps.

All the best,
Chris
-- 
   Chris Samuel  :  http://www.csamuel.org/  :  Berkeley, CA, USA



More information about the slurm-users mailing list