[slurm-users] Slurm node history / log ?

Roberto Monti Roberto.PolverelliMonti at jax.org
Wed Jul 5 17:27:49 UTC 2023

Hi Bill,
Your best bet is probably /var/log/slurmctld on the server that is acting as active controller.


Roberto P. Monti
DevOps Engineer I
roberto.monti at jax.org

The Jackson Laboratory
United States | China | Japan

From: slurm-users <slurm-users-bounces at lists.schedmd.com> On Behalf Of Bill Benedetto
Sent: Wednesday, July 5, 2023 1:21 PM
To: slurm-users at lists.schedmd.com
Subject: [EXTERNAL][slurm-users] Slurm node history / log ?

Good day.

Is there some command that I can use in Slurm to see a node's history?

Not the job history, but the state history.

Something like:
Jul  5 13:11:01 node01 taken offline by slurmctld because node01 not responding

Jul  5 13:11:01 node01 taken offline by USER1 state=DRAIN reason="System acting up, going to reboot"

Jul  5 13:11:01 node01 online by USER1

My goal/idea is to see if a node has been having problems according to Slurm itself.
Or if someone DOWNed a node for some reason.
Or to see if a node was down and just returned to service recently.

Does anything like that already exist in Slurm?


- Bill
Bill Benedetto     bbenedetto at goodyear.com<mailto:bbenedetto at goodyear.com>    The Goodyear Tire & Rubber Co.
I don't speak for Goodyear and they don't speak for me.  We're both happy.

The information in this email, including attachments, may be confidential and is intended solely for the addressee(s). If you believe you received this email by mistake, please notify the sender by return email as soon as possible.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20230705/cb638442/attachment.htm>

More information about the slurm-users mailing list