Fixed the firewall issue. Now error is error: Node fgcu-compute04 appears to have a different slurm.
Nilesh Dhumal
Associate Professor of Chemistry,
http://faculty.fgcu.edu/ndhumal/
Coordinator, FGCU Computational Facility,
https://www.fgcu.edu/cas/facultyresources/computationalfacility/ SH-430; Department of Chemistry and Physics Florida Gulf Coast University 10501 FGCU Boulevard South Fort Myers, FL 33965-6565 Phone: (239) 745-4394 Email: ndhumal@fgcu.edu
________________________________ From: Renfro, Michael Renfro@tntech.edu Sent: Saturday, September 20, 2025 12:21 PM To: Dhumal, Dr. Nilesh ndhumal@fgcu.edu; Slurm User Community List slurm-users@lists.schedmd.com Subject: Re: Compute node not responding
External Email: Do not click links or attachments unless you recognize the sender and know the content is safe.
Level II: Internal
slurmdbd is not a requirement to get things started [1], but you'll probably want it later.
It’s possible you’ve got host-based firewall rules on either system that are blocking communication. If you’re using firewalld, ufw, or something similar, stop their services, restart the slurmd and slurmctld services, and see if that helps.
[1] https://slurm.schedmd.com/quickstart_admin.html#dbd
Level II: Internal
From: Dhumal, Dr. Nilesh via slurm-users slurm-users@lists.schedmd.com Date: Saturday, September 20, 2025 at 11:02 AM To: Slurm User Community List slurm-users@lists.schedmd.com Subject: [slurm-users] Compute node not responding
External Email Warning
This email originated from outside the university. Please use caution when opening attachments, clicking links, or responding to requests.
________________________________ Hello,
Recently, we installed slum 25 on our cluster. We are not monitoring the user's account. We didn't configure the sql database on the head node. We are running slurmcld on head node and slumd on the compute node. We are getting the following error Head node: compute node not responding. Compute node: 2025-09-19T15:30:23.461] error: Unable to register: Unable to contact slurm controller (connect failure)
Do we need to run slumdbd on the head node? I checked the network connection by pinging the compute node from the head node. Do you have any suggestions to resolve this issue?
Thanks Nilesh
Get Outlook for Androidhttps://aka.ms/AAb9ysg