[slurm-users] GPU-node not waking up after power-save

Ümit Seren uemit.seren at gmail.com
Thu Oct 13 07:43:18 UTC 2022


We use power saving with our GPU nodes and they power up fine. They take a bit longer to boot but that’s it.
What do you mean with not waking up ?
The power on script is not called ?
Best
Ümit

From: slurm-users <slurm-users-bounces at lists.schedmd.com> on behalf of Loris Bennett <loris.bennett at fu-berlin.de>
Date: Thursday, 13. October 2022 at 08:14
To: Slurm Users Mailing List <slurm-users at lists.schedmd.com>
Subject: [slurm-users] GPU-node not waking up after power-save
Hi,

We use Slurm's power saving mechanism to switch of idle nodes.  However,
we don't currently use it for our GPU nodes.  This is because in the
past these nodes failed to wake up again when jobs were submitted to the
GPU partition.  Before we look at the issue due to the current energy
situation, I was wondering whether this a problem others have (had).

So does power-saving work in general for GPU nodes and, if so, are there
any extra steps one needs to take in order to set things up properly?

Cheers,

Loris

--
Dr. Loris Bennett (Herr/Mr)
ZEDAT, Freie Universität Berlin         Email loris.bennett at fu-berlin.de
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20221013/e0b0b160/attachment.htm>


More information about the slurm-users mailing list