[slurm-users] "Low socket*core*thre" - solution?
Mahmood Naderan
mahmood.nt at gmail.com
Sun May 6 03:28:55 MDT 2018
Although this thread belongs to someone else, but a solution may apply
for others too.
[root at rocks7 ~]# lscpu
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Byte Order: Little Endian
CPU(s): 32
On-line CPU(s) list: 0-31
Thread(s) per core: 2
Core(s) per socket: 8
Socket(s): 2
NUMA node(s): 4
Vendor ID: AuthenticAMD
CPU family: 21
Model: 2
Model name: AMD Opteron(tm) Processor 6380
Stepping: 0
CPU MHz: 1400.000
CPU max MHz: 2500.0000
CPU min MHz: 1400.0000
BogoMIPS: 4999.86
Virtualization: AMD-V
L1d cache: 16K
L1i cache: 64K
L2 cache: 2048K
L3 cache: 6144K
NUMA node0 CPU(s): 0-7
NUMA node1 CPU(s): 8-15
NUMA node2 CPU(s): 16-23
NUMA node3 CPU(s): 24-31
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep
mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx
mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc art rep_good nopl
nonstop_tsc extd_apicid amd_dcm aperfmperf pni pclmulqdq monitor ssse3
fma cx16 sse4_1 sse4_2 popcnt aes xsave avx f16c lahf_lm cmp_legacy
svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs
xop skinit wdt lwp fma4 tce nodeid_msr tbm topoext perfctr_core
perfctr_nb cpb hw_pstate bmi1 arat npt lbrv svm_lock nrip_save
tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold
[root at rocks7 ~]# slurmd -C
NodeName=rocks7 slurmd: Considering each NUMA node as a socket
CPUs=32 Boards=1 SocketsPerBoard=4 CoresPerSocket=8 ThreadsPerCore=1
RealMemory=64261
UpTime=23-00:51:17
[root at rocks7 ~]# lstopo
bash: lstopo: command not found...
[root at rocks7 ~]# scontrol show node rocks7
NodeName=rocks7 Arch=x86_64 CoresPerSocket=1
CPUAlloc=0 CPUErr=0 CPUTot=1 CPULoad=0.03
AvailableFeatures=(null)
ActiveFeatures=(null)
Gres=(null)
NodeAddr=10.1.1.1 NodeHostName=rocks7 Version=17.11
OS=Linux 3.10.0-693.5.2.el7.x86_64 #1 SMP Fri Oct 20 20:32:50 UTC 2017
RealMemory=64261 AllocMem=0 FreeMem=11102 Sockets=1 Boards=1
State=IDLE+DRAIN ThreadsPerCore=1 TmpDisk=281775 Weight=1 Owner=N/A
MCS_label=N/A
Partitions=WHEEL,EMERALD
BootTime=2018-04-13T13:05:00 SlurmdStartTime=2018-04-13T13:05:17
CfgTRES=cpu=1,mem=64261M,billing=1
AllocTRES=
CapWatts=n/a
CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s
Reason=Low socket*core*thread count, Low CPUs [root at 2018-05-05T21:49:45]
I also have noticed that State returned back to IDLE+DRAIN
!
I am guessing to set Sockets to 32!!
Regards,
Mahmood
On Sun, May 6, 2018 at 9:35 AM, Werner Saar <wernsaar at googlemail.com> wrote:
> Hi,
>
> what is the output of the command:
>
> slurmd -C rocks7
>
>
> Best regards
>
> Werner
>
>
More information about the slurm-users
mailing list