Hi Maarten, On 4/23/26 09:33, Pols, Maarten via slurm-users wrote:
Last Friday, we experienced a power outage which required us to restart the server. After the restart, we were unable to log in to the master node. Eventually, we managed to access the system via a safe mode workaround. Through a process of elimination, we identified the slurmdbd and slurmctld services as the root cause of the issue. Would you happen to have any idea what might have caused this behavior? We have since upgraded to version 25.11.5, which appears to be running smoothly. However, we would still like to understand the underlying cause of the problem.
Console or SSH login to a server should not in any way be related to the slurmctld/slurmdbd daemons. Maybe one of the server's filesystems had become full? A full /root or /tmp disk could prevent logins on any Linux system, because files need to be written to /tmp. IHTH, Ole -- Ole Holm Nielsen PhD, Senior HPC Officer Department of Physics, Technical University of Denmark