[slurm-users] unable to start slurmd process.
Riebs, Andy
andy.riebs at hpe.com
Thu Jun 11 12:34:21 UTC 2020
Navin,
As you can see, systemd provides very little service-specific information. For slurm, you really need to go to the slurm logs to find out what happened.
Hint: A quick way to identify problems like this with slurmd and slurmctld is to run them with the “-Dvvv” option, causing them to log to your window, and usually causing the problem to become immediately obvious.
For example,
# /usr/local/slurm/sbin/slurmd -Dvvvv
Just it ^C when you’re done, if necessary. Of course, if it doesn’t fail when you run it this way, it’s time to look elsewhere.
Andy
From: slurm-users [mailto:slurm-users-bounces at lists.schedmd.com] On Behalf Of navin srivastava
Sent: Thursday, June 11, 2020 8:25 AM
To: Slurm User Community List <slurm-users at lists.schedmd.com>
Subject: [slurm-users] unable to start slurmd process.
Hi Team,
when i am trying to start the slurmd process i am getting the below error.
2020-06-11T13:11:58.652711+02:00 oled3 systemd[1]: Starting Slurm node daemon...
2020-06-11T13:13:28.683840+02:00 oled3 systemd[1]: slurmd.service: Start operation timed out. Terminating.
2020-06-11T13:13:28.684479+02:00 oled3 systemd[1]: Failed to start Slurm node daemon.
2020-06-11T13:13:28.684759+02:00 oled3 systemd[1]: slurmd.service: Unit entered failed state.
2020-06-11T13:13:28.684917+02:00 oled3 systemd[1]: slurmd.service: Failed with result 'timeout'.
2020-06-11T13:15:01.437172+02:00 oled3 cron[8094]: pam_unix(crond:session): session opened for user root by (uid=0)
Slurm version is 17.11.8
The server and slurm is running from long time and we have not made any changes but today when i am starting it is giving this error message.
Any idea what could be wrong here.
Regards
Navin.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20200611/91c563ac/attachment.htm>
More information about the slurm-users
mailing list