[slurm-users] Slurm node history / log ?
Bill Benedetto
bbenedetto at goodyear.com
Wed Jul 5 17:20:46 UTC 2023
Good day.
Is there some command that I can use in Slurm to see a node's history?
Not the job history, but the state history.
Something like:
Jul 5 13:11:01 node01 taken offline by slurmctld because node01 not responding
And/Or:
Jul 5 13:11:01 node01 taken offline by USER1 state=DRAIN reason="System acting up, going to reboot"
And/Or:
Jul 5 13:11:01 node01 online by USER1
My goal/idea is to see if a node has been having problems according to Slurm itself.
Or if someone DOWNed a node for some reason.
Or to see if a node was down and just returned to service recently.
Does anything like that already exist in Slurm?
Thanks!
- Bill
+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+
Bill Benedetto bbenedetto at goodyear.com<mailto:bbenedetto at goodyear.com> The Goodyear Tire & Rubber Co.
I don't speak for Goodyear and they don't speak for me. We're both happy.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20230705/ac74e251/attachment-0001.htm>
More information about the slurm-users
mailing list