14 Apr
2026
14 Apr
'26
1:22 a.m.
On 4/13/26 4:02 am, Massimo Sgaravatto via slurm-users wrote:
CfgTRES=cpu=384,mem=1500G,billing=839,gres/gpu=4,gres/gpu:nvidia-h100=4 AllocTRES=cpu=8,mem=560000M,gres/gpu=4,gres/gpu:nvidia-h100=4
For some reason whatever jobs are running on that node are consuming all 4 GPUs - now the job you mention isn't asking for them:
ReqTRES=cpu=1,mem=100G,node=1,billing=26 AllocTRES=cpu=1,mem=100G,node=1,billing=26
So is it possible there's another job on there too? What does "squeue -w cld-ter-gpu-01" say? Also what does "scontrol show part onlycpus-opp" say? All the best, Chris -- Chris Samuel : http://www.csamuel.org/ : Philadelphia, PA, USA