[slurm-users] Power Save: When is RESUME an invalid node state?
Xaver Stiensmeier
xaverstiensmeier at gmx.de
Wed Dec 6 08:28:12 UTC 2023
Dear Slurm User list,
using https://slurm.schedmd.com/power_save.html we had one case out of
many (>242) node starts that resulted in
|slurm_update error: Invalid node state specified|
when we called:
|scontrol update NodeName="$1" state=RESUME reason=FailedStartup|
in the Fail script. We run this to make 100% sure that the instances -
that are created on demand - are again `~idle` after being removed by
the fail program. They are set to RESUME before the actual instance gets
destroyed. I remember that I had this case manually before, but I don't
remember when it occurs.
Maybe someone has a great idea how to tackle this problem.
Best regards
Xaver Stiensmeier
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20231206/c81f2e93/attachment.htm>
More information about the slurm-users
mailing list