Thanks Ole,
this is very helpful. I was unaware of that issue. From the bug report it's not clear to me if it was just a sreport (display) issue, or if the problem was in the way the data was stored.

In fact I am running 23.11.5 which I installed in April. The numbers I see for the last few months (including April) are fine. The earlier numbers (when I was running an earlier version) are the ones affected by this problem. So if the issue was the way the data was stored, that explains it and I can live with it (even if I can't provide an accurate report for my management now) knowing that the problem won't happen again in the future.

Thanks and have a great weekend

On Fri, Aug 23, 2024 at 8:00 AM Ole Holm Nielsen via slurm-users <slurm-users@lists.schedmd.com> wrote:
Hi Davide,

On 8/22/24 21:30, Davide DelVento via slurm-users wrote:
> I am confused by the reported amount of Down and PLND Down by sreport.
> According to it, our cluster would have had a significant amount of
> downtime, which I know didn't happen (or, according to the documentation
> "time that slurmctld was not responding", see
> https://slurm.schedmd.com/sreport.html
> <https://slurm.schedmd.com/sreport.html>)
>
> Could it be my purge settings causing this problem? How can I check (maybe
> in some logs, maybe in the future) if actually slurmctld was not
> responding? The expected long-term numbers should be less than the ones
> reported for last month when we had an issue with a few nodes....

Which version of Slurm are you using?  There was an sreport bug that
should be fixed in 23.11: https://support.schedmd.com/show_bug.cgi?id=17689

/Ole



--
slurm-users mailing list -- slurm-users@lists.schedmd.com
To unsubscribe send an email to slurm-users-leave@lists.schedmd.com