<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<div class="moz-cite-prefix">On 20/6/19 3:24 am, Brian Andrus wrote:<br>
</div>
<blockquote type="cite"
cite="mid:a61efeb9-4ee5-3c97-b2c1-53637e8f6bde@gmail.com">
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<p>Can you give the exact command/output you have from this?</p>
<p>I suspect a typo in your slurm.conf for nodenames or what you
are typing.</p>
<p>Brian Andrus</p>
<p><br>
</p>
</blockquote>
<p>Hi Brian,</p>
<p>I am pretty sure there is no error in my typing of the commands,
but just in case find below the command. I ran scontrol show node
with a working and not working node.<br>
</p>
<p><br>
</p>
<p>Any other ideas?</p>
<p><br>
</p>
<p>Nathan<br>
</p>
<br>
<p>[centos@bt_slurm_master ~]$cat /etc/slurm/slurm.conf<br>
</p>
<p>.....<br>
</p>
<p>NodeName=ip-10-0-8-[2-100] CPUs=16 RealMemory=27648 Sockets=1
CoresPerSocket=16 ThreadsPerCore=1 State=CLOUD</p>
NodeName=bt_slurm_login00[1-10] RealMemory=512
State=DOWN#oPartitionName=backtest Nodes=ip-10-0-8-[2-100]
Default=YES MaxTime=300 Oversubscribe=NO State=UP
<p class="MsoNormal">PartitionName=backtest
Nodes=ip-10-0-8-[2-100] Default=YES MaxTime=300 Oversubscribe=NO
State=UP Priority=1 PreemptMode=requeue
</p>
<p class="MsoNormal"> ....</p>
<p class="MsoNormal"><br>
</p>
<p class="MsoNormal">[centos@bt_slurm_master ~]$ sinfo</p>
<p class="MsoNormal">PARTITION AVAIL TIMELIMIT NODES STATE
NODELIST</p>
<p class="MsoNormal">backtest* up 5:00:00 2 down*
ip-10-0-8-[29-30]</p>
<p class="MsoNormal">backtest* up 5:00:00 52 mix
ip-10-0-8-[4-17,19-24,26-28,31-59]</p>
<p class="MsoNormal"> </p>
<p class="MsoNormal">[centos@bt_slurm_master ~]$ scontrol show node
ip-10-0-8-3</p>
<p class="MsoNormal">Node ip-10-0-8-3 not found</p>
<p class="MsoNormal">[centos@bt_slurm_master ~]$ scontrol show node
ip-10-0-8-4</p>
<p class="MsoNormal">NodeName=ip-10-0-8-4 Arch=x86_64
CoresPerSocket=16</p>
<p class="MsoNormal"> CPUAlloc=5 CPUTot=16 CPULoad=21.10</p>
<p class="MsoNormal"> AvailableFeatures=(null)</p>
<p class="MsoNormal"> ActiveFeatures=(null)</p>
<p class="MsoNormal"> Gres=(null)</p>
<p class="MsoNormal"> NodeAddr=10.0.8.4 NodeHostName=ip-10-0-8-4
Port=0 Version=18.08</p>
<p class="MsoNormal"> OS=Linux 3.10.0-957.1.3.el7.x86_64 #1 SMP
Thu Nov 29 14:49:43 UTC 2018</p>
<p class="MsoNormal"> RealMemory=27648 AllocMem=0 FreeMem=13581
Sockets=1 Boards=1</p>
<p class="MsoNormal"> State=MIXED+CLOUD ThreadsPerCore=1 TmpDisk=0
Weight=1 Owner=N/A MCS_label=N/A</p>
<p class="MsoNormal"> Partitions=backtest</p>
<p class="MsoNormal"> BootTime=2019-06-18T03:09:03
SlurmdStartTime=2019-06-18T03:13:45</p>
<p class="MsoNormal"> CfgTRES=cpu=16,mem=27G,billing=16</p>
<p class="MsoNormal"> AllocTRES=cpu=5</p>
<p class="MsoNormal"> CapWatts=n/a</p>
<p class="MsoNormal"> CurrentWatts=0 LowestJoules=0
ConsumedJoules=0</p>
<p class="MsoNormal"> ExtSensorsJoules=n/s ExtSensorsWatts=0
ExtSensorsTemp=n/s</p>
<p class="MsoNormal"> </p>
<p class="MsoNormal"> </p>
</body>
</html>