[slurm-users] status of cloud nodes

nathan norton nathan at nanoservices.com.au
Thu Jun 20 09:02:02 UTC 2019


On 20/6/19 3:24 am, Brian Andrus wrote:
>
> Can you give the exact command/output you have from this?
>
> I suspect a typo in your slurm.conf for nodenames or what you are typing.
>
> Brian Andrus
>
>
Hi Brian,

I am pretty sure there is no error in my typing of the commands, but 
just in case find below the command. I ran scontrol show node with a 
working and not working node.


Any other ideas?


Nathan


[centos at bt_slurm_master ~]$cat /etc/slurm/slurm.conf

.....

NodeName=ip-10-0-8-[2-100] CPUs=16 RealMemory=27648 Sockets=1 
CoresPerSocket=16 ThreadsPerCore=1  State=CLOUD

NodeName=bt_slurm_login00[1-10] RealMemory=512 
State=DOWN#oPartitionName=backtest Nodes=ip-10-0-8-[2-100] Default=YES 
MaxTime=300 Oversubscribe=NO State=UP

PartitionName=backtest Nodes=ip-10-0-8-[2-100] Default=YES MaxTime=300 
Oversubscribe=NO State=UP Priority=1 PreemptMode=requeue

  ....


[centos at bt_slurm_master ~]$ sinfo

PARTITION   AVAIL  TIMELIMIT  NODES  STATE NODELIST

backtest*      up    5:00:00      2  down* ip-10-0-8-[29-30]

backtest*      up    5:00:00     52    mix 
ip-10-0-8-[4-17,19-24,26-28,31-59]

[centos at bt_slurm_master ~]$ scontrol  show node ip-10-0-8-3

Node ip-10-0-8-3 not found

[centos at bt_slurm_master ~]$ scontrol  show node ip-10-0-8-4

NodeName=ip-10-0-8-4 Arch=x86_64 CoresPerSocket=16

    CPUAlloc=5 CPUTot=16 CPULoad=21.10

    AvailableFeatures=(null)

    ActiveFeatures=(null)

    Gres=(null)

    NodeAddr=10.0.8.4 NodeHostName=ip-10-0-8-4 Port=0 Version=18.08

    OS=Linux 3.10.0-957.1.3.el7.x86_64 #1 SMP Thu Nov 29 14:49:43 UTC 2018

    RealMemory=27648 AllocMem=0 FreeMem=13581 Sockets=1 Boards=1

    State=MIXED+CLOUD ThreadsPerCore=1 TmpDisk=0 Weight=1 Owner=N/A 
MCS_label=N/A

    Partitions=backtest

    BootTime=2019-06-18T03:09:03 SlurmdStartTime=2019-06-18T03:13:45

    CfgTRES=cpu=16,mem=27G,billing=16

    AllocTRES=cpu=5

    CapWatts=n/a

    CurrentWatts=0 LowestJoules=0 ConsumedJoules=0

    ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20190620/ab83b328/attachment.html>


More information about the slurm-users mailing list