[slurm-users] Power Save: When is RESUME an invalid node state?

Xaver Stiensmeier xaverstiensmeier at gmx.de
Wed Dec 6 08:28:12 UTC 2023


Dear Slurm User list,

using https://slurm.schedmd.com/power_save.html we had one case out of
many (>242) node starts that resulted in

|slurm_update error: Invalid node state specified|

when we called:

|scontrol update NodeName="$1" state=RESUME reason=FailedStartup|

in the Fail script. We run this to make 100% sure that the instances -
that are created on demand - are again `~idle` after being removed by
the fail program. They are set to RESUME before the actual instance gets
destroyed. I remember that I had this case manually before, but I don't
remember when it occurs.

Maybe someone has a great idea how to tackle this problem.

Best regards
Xaver Stiensmeier
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20231206/c81f2e93/attachment.htm>


More information about the slurm-users mailing list