[slurm-users] Job are pending when plenty of resources available

Marcus Wagner wagner at itc.rwth-aachen.de
Tue Mar 31 05:51:32 UTC 2020


Hi Mike,

but that would mean, that 409978 requests nearly the whole cluster. I'm 
wondering for what resources it waits.
Yet, there are nearly 32000 nodes idle. I would assume, such one node 
job would fit. But you are right, depends on the higher prio job.

Best
Marcus


On 3/30/20 3:47 PM, Renfro, Michael wrote:
> All of this is subject to scheduler configuration, but: what has job 409978 requested, in terms of resources and time? It looks like it's the highest priority pending job in the interactive partition, and I’d expect the interactive partition has a higher priority than the regress partition.
>
> As for job 409999, it’s requesting 8 cores and 32 GB of RAM for an infinite amount of time, not 1 core and 1 GB of RAM.
>
> *If* job 409978 has requested an large amount of time on the entire cluster, *and* you don’t have backfill running, I could see this situation happening.
>

-- 
Marcus Wagner, Dipl.-Inf.

IT Center
Abteilung: Systeme und Betrieb
RWTH Aachen University
Seffenter Weg 23
52074 Aachen
Tel: +49 241 80-24383
Fax: +49 241 80-624383
wagner at itc.rwth-aachen.de
www.itc.rwth-aachen.de




More information about the slurm-users mailing list