<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40"><head><meta http-equiv=Content-Type content="text/html; charset=utf-8"><meta name=Generator content="Microsoft Word 15 (filtered medium)"><style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
span.EmailStyle18
{mso-style-type:personal-reply;
font-family:"Calibri",sans-serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-family:"Calibri",sans-serif;
mso-fareast-language:EN-US;}
@page WordSection1
{size:612.0pt 792.0pt;
margin:72.0pt 72.0pt 72.0pt 72.0pt;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]--></head><body lang=EN-GB link=blue vlink=purple style='word-wrap:break-word'><div class=WordSection1><p class=MsoNormal><span style='mso-fareast-language:EN-US'>If this is a single host machine I suggest checking the /etc/hosts file to make sure that ‘mannose’ is listed as you expect. It is generally advised to use FQDNs for host names; the fact that the message “</span>connection to host:mannose:6819: Connection refused<span style='mso-fareast-language:EN-US'>” used a short name may mean that in a configuration file you have a shortname. Equally the incoming connection may be coming not from the IP of ‘mannose’ but from localhost (127.0.0.1 if you are using only IPv4).<o:p></o:p></span></p><p class=MsoNormal><span style='mso-fareast-language:EN-US'><o:p> </o:p></span></p><p class=MsoNormal><span style='mso-fareast-language:EN-US'>You also have a cluster name that looks like an FQDN, you may want to change that to something else; the cluster name is I think an abstract name, where host names must be for real nodes that are resolvable.<o:p></o:p></span></p><p class=MsoNormal><span style='mso-fareast-language:EN-US'><o:p> </o:p></span></p><p class=MsoNormal><span style='mso-fareast-language:EN-US'>You may also find information in /var/log/messages or /var/log/secure….if applicable to your Linux distro.<o:p></o:p></span></p><p class=MsoNormal><span style='mso-fareast-language:EN-US'><o:p> </o:p></span></p><p class=MsoNormal><span style='mso-fareast-language:EN-US'>I use Slurm with firewalld and it is fine usually.<o:p></o:p></span></p><p class=MsoNormal><span style='mso-fareast-language:EN-US'><o:p> </o:p></span></p><p class=MsoNormal><span style='mso-fareast-language:EN-US'>William<o:p></o:p></span></p><p class=MsoNormal><span style='mso-fareast-language:EN-US'><o:p> </o:p></span></p><div style='border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0cm 0cm 0cm'><p class=MsoNormal><b><span lang=EN-US>From:</span></b><span lang=EN-US> slurm-users <slurm-users-bounces@lists.schedmd.com> <b>On Behalf Of </b>Sushil Mishra<br><b>Sent:</b> 30 November 2022 22:44<br><b>To:</b> Slurm User Community List <slurm-users@lists.schedmd.com><br><b>Subject:</b> [slurm-users] slurm_persist_conn_open_without_init: failed to open persistent connection to host<o:p></o:p></span></p></div><p class=MsoNormal><o:p> </o:p></p><div><p class=MsoNormal>Hi all,<o:p></o:p></p><div><p class=MsoNormal><o:p> </o:p></p></div><div><p class=MsoNormal>I installed slurm and enable accounting in a single-node machine, i.e same server is the master and computing node. I mainly followed this page for instructions:<o:p></o:p></p></div><div><p class=MsoNormal><a href="https://southgreenplatform.github.io/trainings/hpc/slurminstallation/">https://southgreenplatform.github.io/trainings/hpc/slurminstallation/</a><o:p></o:p></p></div><div><p class=MsoNormal>After enabling accounting I am having problems in starting slurmctld.service. <o:p></o:p></p></div><div><p class=MsoNormal>[root@mannose sushil]# cat /var/log/slurm/slurmctld.log <br>[2022-11-30T16:32:15.194] Job accounting information stored, but details not gathered<br>[2022-11-30T16:32:15.195] slurmctld version 20.11.9 started on cluster <a href="http://mannose.olemiss.edu">mannose.olemiss.edu</a><br>[2022-11-30T16:32:15.201] error: slurm_persist_conn_open_without_init: failed to open persistent connection to host:mannose:6819: Connection refused<br>[2022-11-30T16:32:15.201] error: Sending PersistInit msg: Connection refused<br>[2022-11-30T16:32:15.201] accounting_storage/slurmdbd: clusteracct_storage_p_register_ctld: Registering slurmctld at port 6817 with slurmdbd<br>[2022-11-30T16:32:15.203] error: Sending PersistInit msg: Connection refused<br>[2022-11-30T16:32:15.203] error: Association database appears down, reading from state file.<br>[2022-11-30T16:32:15.203] error: Unable to get any information from the state file<br>[2022-11-30T16:32:15.203] fatal: slurmdbd and/or database must be up at slurmctld start time<o:p></o:p></p></div><div><p class=MsoNormal><o:p> </o:p></p></div><div><p class=MsoNormal>It is not clear why slurm port 8619 is being used while I have SlurmctldPort=6817 and SlurmdPort=6818 set in clurm.conf. anyways, I opened all three posrts (6817, 6818 and 6819) using 'firewall-cmd --permanent --zone=public --add-port=6819/tcp'<o:p></o:p></p></div><div><p class=MsoNormal><o:p> </o:p></p></div><div><p class=MsoNormal>MariaDB [(none)]> show grants<o:p></o:p></p></div><div><p class=MsoNormal> -> ;<br>+--------------------------------------------------------------------------------------------------------------+<br>| Grants for slurm@localhost |<br>+--------------------------------------------------------------------------------------------------------------+<br>| GRANT USAGE ON *.* TO 'slurm'@'localhost' IDENTIFIED BY PASSWORD '*0E54A04D59B6C9F7B7B6269BE7F30AD3E3409895' |<br>| GRANT ALL PRIVILEGES ON `slurm_acct_db`.* TO 'slurm'@'localhost' WITH GRANT OPTION |<br>+--------------------------------------------------------------------------------------------------------------+<br>2 rows in set (0.00 sec)<br><br>MariaDB [(none)]> quit<o:p></o:p></p></div><div><p class=MsoNormal><o:p> </o:p></p></div><div><p class=MsoNormal>Can someone help in figuring out possibly what is going wrong? <o:p></o:p></p></div><div><p class=MsoNormal><o:p> </o:p></p></div><div><p class=MsoNormal>Best,<o:p></o:p></p></div><div><p class=MsoNormal>SK<o:p></o:p></p></div><div><p class=MsoNormal><o:p> </o:p></p></div></div></div></body></html>