[slurm-users] status of cloud nodes
toomuchit at gmail.com
Wed Jun 19 17:24:43 UTC 2019
Can you give the exact command/output you have from this?
I suspect a typo in your slurm.conf for nodenames or what you are typing.
On 6/18/2019 11:29 PM, nathan norton wrote:
> It just shows
> "Node $NODE not found"
> Whereas others all work as expected (ie, they are running)
> Without knowing the internals of slurm it feels like nodes that are
> turned off+cloud state don't exist in the system until they are on?
> Any other ideas?
> On Wed., 19 Jun. 2019, 4:21 pm Chris Samuel, <chris at csamuel.org
> <mailto:chris at csamuel.org>> wrote:
> On Tuesday, 18 June 2019 9:36:56 PM PDT nathan norton wrote:
> > Just tried running that command, but it only shows nodes that
> are up and
> > running, doesn’t tell me about any nodes that are down and
> turned off, as
> > an example please see below. There is a job running that should
> be using
> > the 100 nodes but only 52 are allocated (plus 2 down* (that I
> know about
> > and don’t care about in this case)) where are the stats and
> details on why
> > the 40ish other nodes are not being used? (nothing in the
> masters log file
> > either)
> I suspect this is related to their cloud state.
> What does "scontrol show node $NODE" say where $NODE is the name
> of a node
> that isn't being listed despite you expecting it to be?
> All the best,
> Chris Samuel : http://www.csamuel.org/ : Berkeley, CA, USA
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the slurm-users