[slurm-users] Half of cpus used despite CR_CPU
Adrian Sevcenco
Adrian.Sevcenco at spacescience.ro
Wed Apr 13 07:59:03 UTC 2022
Hi! I have a weird situation with a cluster that i switched from CR_Core to CR_CPU
select/cons_res, TaskPlugin=task/affinity,task/cgroup TaskPluginParam=autobind=threads
despite reporting in the jobs that only 1 CPU is needed:
NumNodes=1 NumCPUs=1 NumTasks=1 CPUs/Task=1 ReqB:S:C:T=0:0:*:*
TRES=cpu=1,node=1,billing=1
Socks/Node=* NtasksPerN:B:S:C=0:0:*:* CoreSpec=*
JOB_GRES=(null)
Nodes=issaf-0-0 CPU_IDs=106-107 Mem=0 GRES=
MinCPUsNode=1 MinMemoryNode=0 MinTmpDiskNode=0
Features=(null) DelayBoot=00:00:00
OverSubscribe=OK Contiguous=0 Licenses=(null) Network=(null)
only half of the job slots are used
then, sinfo reports that all cpus are used
root at issaf: ~ # sinfo -o "%10R %.16N %.6a %.14F %.14C %.12L %.12l"
PARTITION NODELIST AVAIL NODES(A/I/O/T) CPUS(A/I/O/T) DEFAULTTIME TIMELIMIT
CLUSTER issaf-0-[0-2] up 3/0/0/3 384/0/0/384 2-00:00:00 20-00:00:00
but
root at issaf: ~ # squeue -h -t R | wc -l
192
Does anyone have any idea/experience why not all 384 cores are used as 384 job slots?
Thanks a lot!
Adrian
More information about the slurm-users
mailing list