[slurm-users] [External] Cancel "reboot ASAP" for a node

Prentice Bisbal pbisbal at pppl.gov
Fri Aug 7 16:41:47 UTC 2020


Are you looking at the man page on the SchedMD website, or on your 
computer? If you're looking at the website, those pages are for the 
latest version and may not match what you have installed, so this could 
be a feature in a later version tha 18.08.

--
Prentice

On 8/7/20 11:43 AM, Hanby, Mike wrote:
>
> Howdy, (Slurm 18.08)
>
> We have a bunch of node that we've updated to "scontrol reboot ASAP".
>
> We'd like to cancel a few of those. From the man page, it's suggested 
> that either of the following should work, however both report the same 
> error " slurm_update error: Invalid node state specified":
>
> scontrol cancel_reboot c01
>
> or
>
> scontrol Update NodeName=c01 State=CANCEL_REBOOT
>
> Here's the 'scontrol show node c01' info for reference:
>
> NodeName=c01 Arch=x86_64 CoresPerSocket=12
>
> CPUAlloc=7 CPUTot=24 CPULoad=7.04
>
> AvailableFeatures=(null)
>
> ActiveFeatures=(null)
>
> Gres=(null)
>
> NodeAddr=c0115 NodeHostName=c01 Version=18.08
>
>    OS=Linux 3.10.0-1062.9.1.el7.x86_64 #1 SMP Mon Dec 2 08:31:54 EST 2019
>
> RealMemory=191877 AllocMem=6536 FreeMem=176717 Sockets=2 Boards=1
>
> State=MIXED+DRAIN ThreadsPerCore=1 TmpDisk=887366 Weight=1 Owner=N/A 
> MCS_label=N/A
>
> Partitions=interactive,short,long,medium,express
>
> BootTime=2020-07-08T23:16:27 SlurmdStartTime=2020-07-08T23:32:05
>
> CfgTRES=cpu=24,mem=191877M,billing=24
>
> AllocTRES=cpu=7,mem=6536M
>
> CapWatts=n/a
>
> CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
>
> ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s
>
> Reason=Reboot ASAP [root at 2020-08-06T10:29:22]
>
> Any thoughts as to how to cancel the reboot?
>
> ----------------
>
> Mike Hanby
>
> mhanby @ uab.edu
>
> Systems Analyst III - Enterprise
>
> IT Research Computing Services
>
> The University of Alabama at Birmingham
>
-- 
Prentice Bisbal
Lead Software Engineer
Research Computing
Princeton Plasma Physics Laboratory
http://www.pppl.gov

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20200807/4a4f9c80/attachment-0001.htm>


More information about the slurm-users mailing list