[slurm-users] Slurm node history / log ?
Roberto.PolverelliMonti at jax.org
Wed Jul 5 17:27:49 UTC 2023
Your best bet is probably /var/log/slurmctld on the server that is acting as active controller.
Roberto P. Monti
DevOps Engineer I
roberto.monti at jax.org
The Jackson Laboratory
United States | China | Japan
From: slurm-users <slurm-users-bounces at lists.schedmd.com> On Behalf Of Bill Benedetto
Sent: Wednesday, July 5, 2023 1:21 PM
To: slurm-users at lists.schedmd.com
Subject: [EXTERNAL][slurm-users] Slurm node history / log ?
Is there some command that I can use in Slurm to see a node's history?
Not the job history, but the state history.
Jul 5 13:11:01 node01 taken offline by slurmctld because node01 not responding
Jul 5 13:11:01 node01 taken offline by USER1 state=DRAIN reason="System acting up, going to reboot"
Jul 5 13:11:01 node01 online by USER1
My goal/idea is to see if a node has been having problems according to Slurm itself.
Or if someone DOWNed a node for some reason.
Or to see if a node was down and just returned to service recently.
Does anything like that already exist in Slurm?
Bill Benedetto bbenedetto at goodyear.com<mailto:bbenedetto at goodyear.com> The Goodyear Tire & Rubber Co.
I don't speak for Goodyear and they don't speak for me. We're both happy.
The information in this email, including attachments, may be confidential and is intended solely for the addressee(s). If you believe you received this email by mistake, please notify the sender by return email as soon as possible.
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the slurm-users