<html xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=Windows-1252">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
        {font-family:Wingdings;
        panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
        {font-family:"Cambria Math";
        panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0cm;
        font-size:11.0pt;
        font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}
p.MsoListParagraph, li.MsoListParagraph, div.MsoListParagraph
        {mso-style-priority:34;
        margin-top:0cm;
        margin-right:0cm;
        margin-bottom:0cm;
        margin-left:36.0pt;
        font-size:11.0pt;
        font-family:"Calibri",sans-serif;}
.MsoChpDefault
        {mso-style-type:export-only;
        font-size:10.0pt;}
@page WordSection1
        {size:612.0pt 792.0pt;
        margin:72.0pt 72.0pt 72.0pt 72.0pt;}
div.WordSection1
        {page:WordSection1;}
/* List Definitions */
@list l0
        {mso-list-id:770591545;
        mso-list-template-ids:733897740;}
@list l0:level1
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:36.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l0:level2
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:72.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l0:level3
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:108.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l0:level4
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:144.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l0:level5
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:180.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l0:level6
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:216.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l0:level7
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:252.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l0:level8
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:288.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l0:level9
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:324.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l1
        {mso-list-id:857738487;
        mso-list-type:hybrid;
        mso-list-template-ids:336119376 67698689 67698691 67698693 67698689 67698691 67698693 67698689 67698691 67698693;}
@list l1:level1
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:Symbol;}
@list l1:level2
        {mso-level-number-format:bullet;
        mso-level-text:o;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:"Courier New";}
@list l1:level3
        {mso-level-number-format:bullet;
        mso-level-text:\F0A7 ;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:Wingdings;}
@list l1:level4
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:Symbol;}
@list l1:level5
        {mso-level-number-format:bullet;
        mso-level-text:o;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:"Courier New";}
@list l1:level6
        {mso-level-number-format:bullet;
        mso-level-text:\F0A7 ;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:Wingdings;}
@list l1:level7
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:Symbol;}
@list l1:level8
        {mso-level-number-format:bullet;
        mso-level-text:o;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:"Courier New";}
@list l1:level9
        {mso-level-number-format:bullet;
        mso-level-text:\F0A7 ;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:Wingdings;}
@list l2
        {mso-list-id:1682930086;
        mso-list-template-ids:1939503450;}
@list l2:level1
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:36.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l2:level2
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:72.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l2:level3
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:108.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l2:level4
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:144.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l2:level5
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:180.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l2:level6
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:216.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l2:level7
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:252.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l2:level8
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:288.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l2:level9
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:324.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l3
        {mso-list-id:1845513158;
        mso-list-type:hybrid;
        mso-list-template-ids:2068076232 67698689 67698691 67698693 67698689 67698691 67698693 67698689 67698691 67698693;}
@list l3:level1
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:Symbol;}
@list l3:level2
        {mso-level-number-format:bullet;
        mso-level-text:o;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:"Courier New";}
@list l3:level3
        {mso-level-number-format:bullet;
        mso-level-text:\F0A7 ;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:Wingdings;}
@list l3:level4
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:Symbol;}
@list l3:level5
        {mso-level-number-format:bullet;
        mso-level-text:o;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:"Courier New";}
@list l3:level6
        {mso-level-number-format:bullet;
        mso-level-text:\F0A7 ;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:Wingdings;}
@list l3:level7
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7 ;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:Symbol;}
@list l3:level8
        {mso-level-number-format:bullet;
        mso-level-text:o;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:"Courier New";}
@list l3:level9
        {mso-level-number-format:bullet;
        mso-level-text:\F0A7 ;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        font-family:Wingdings;}
ol
        {margin-bottom:0cm;}
ul
        {margin-bottom:0cm;}
--></style>
</head>
<body lang="EN-CA" link="blue" vlink="purple" style="word-wrap:break-word">
<div class="WordSection1">
<p class="MsoNormal">A few things to check here:<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<ul style="margin-top:0cm" type="disc">
<li class="MsoListParagraph" style="margin-left:0cm;mso-list:l3 level1 lfo3">Ensure that your firewall ports are open – ports 6817/6818/6819/3306<o:p></o:p></li><li class="MsoListParagraph" style="margin-left:0cm;mso-list:l3 level1 lfo3">Make sure that munge is working correctly:<o:p></o:p></li></ul>
<p class="MsoNormal" style="margin-left:72.0pt"><span style="font-family:"Courier New"">$ munge -n | unmunge<o:p></o:p></span></p>
<p class="MsoNormal"><o:p> </o:p></p>
<ul style="margin-top:0cm" type="disc">
<li class="MsoListParagraph" style="margin-left:0cm;mso-list:l1 level1 lfo6">Make sure you go through the accounting web-page as well -
<a href="https://slurm.schedmd.com/accounting.html">https://slurm.schedmd.com/accounting.html</a><o:p></o:p></li><ul style="margin-top:0cm" type="circle">
<li class="MsoListParagraph" style="margin-left:0cm;mso-list:l1 level2 lfo6">In particular, ensure that you can connect to the MySQL server, create the slurm user within MySQL database, give it the required permissions, etc,  Go through the “Live example” on
 the accounting web-page.<o:p></o:p></li></ul>
<li class="MsoListParagraph" style="margin-left:0cm;mso-list:l1 level1 lfo6">Walk through your log files – especially the slurmdbd.log file and clear up all errors.<o:p></o:p></li><li class="MsoListParagraph" style="margin-left:0cm;mso-list:l1 level1 lfo6">As a general comment, put in the fewest number of configuration options into your slurm.conf and slurmdbd.conf file as possible – use the defaults when you can.  Add items incrementally
 and carefully so you can back-out easily when you make mistakes (and you will!)<o:p></o:p></li><li class="MsoListParagraph" style="margin-left:0cm;mso-list:l1 level1 lfo6">In my slurm.conf, I also have specified the AccountingStorageHost, AccountingStorageUser and AccountingStoragePort – not sure if I need any of these though…<o:p></o:p></li></ul>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Mike<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<div style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal" style="margin-bottom:12.0pt"><b><span style="font-size:12.0pt;color:black">From:
</span></b><span style="font-size:12.0pt;color:black">slurm-users <slurm-users-bounces@lists.schedmd.com> on behalf of slurm-users-request@lists.schedmd.com <slurm-users-request@lists.schedmd.com><br>
<b>Date: </b>Tuesday, February 2, 2021 at 8:16 AM<br>
<b>To: </b>slurm-users@lists.schedmd.com <slurm-users@lists.schedmd.com><br>
<b>Subject: </b>slurm-users Digest, Vol 40, Issue 4<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal">Send slurm-users mailing list submissions to<br>
        slurm-users@lists.schedmd.com<br>
<br>
To subscribe or unsubscribe via the World Wide Web, visit<br>
        <a href="https://lists.schedmd.com/cgi-bin/mailman/listinfo/slurm-users">
https://lists.schedmd.com/cgi-bin/mailman/listinfo/slurm-users</a><br>
or, via email, send a message with subject or body 'help' to<br>
        slurm-users-request@lists.schedmd.com<br>
<br>
You can reach the person managing the list at<br>
        slurm-users-owner@lists.schedmd.com<br>
<br>
When replying, please edit your Subject line so it is more specific<br>
than "Re: Contents of slurm-users digest..."<br>
<br>
<br>
Today's Topics:<br>
<br>
   1. Slurm - sacct: error: slurm_persist_conn_open_without_init:<br>
      failed to open persistent connection to host:localhost:6819:<br>
      Connection refused (Zainul Abiddin)<br>
   2. Re: Slurm - Munge configuration details (Benson Muite)<br>
<br>
<br>
----------------------------------------------------------------------<br>
<br>
Message: 1<br>
Date: Tue, 2 Feb 2021 18:35:20 +0530<br>
From: Zainul Abiddin <zainul1114@gmail.com><br>
To: slurm-users@lists.schedmd.com<br>
Subject: [slurm-users] Slurm - sacct: error:<br>
        slurm_persist_conn_open_without_init: failed to open persistent<br>
        connection to host:localhost:6819: Connection refused<br>
Message-ID:<br>
        <CAA9R82u0L7VdZDhvP_1KfWmVrLL-Cc5VhAVr2SgTuwN_1AXuUA@mail.gmail.com><br>
Content-Type: text/plain; charset="utf-8"<br>
<br>
Hi All,<br>
I have done slurmdbd configuration and while i am trying to run account<br>
manager with *sacct* i am getting below error.<br>
<br>
[root@smaster ~]# sacct<br>
sacct: error: slurm_persist_conn_open_without_init: failed to open<br>
persistent connection to host:localhost:6819: Connection refused<br>
sacct: error: Sending PersistInit msg: Connection refused<br>
sacct: error: Problem talking to the database: Connection refused<br>
[root@smaster ~]#<br>
<br>
My slurmdbd configuration :<br>
[root@smaster ~]# cat /etc/slurm/slurmdbd.conf<br>
AuthType=auth/munge<br>
DbdAddr=localhost<br>
DbdHost=localhost<br>
SlurmUser=slurm<br>
DebugLevel=4<br>
LogFile=/var/log/slurm/slurmdbd.log<br>
PidFile=/var/run/slurmdbd.pid<br>
StorageType=accounting_storage/mysql<br>
StorageHost=localhost<br>
StoragePass=password<br>
StorageUser=slurm<br>
StorageLoc=slurm_acct_db<br>
<br>
[root@smaster ~]# chown slurm: /etc/slurm/slurmdbd.conf<br>
[root@smaster ~]# chmod 600 /etc/slurm/slurmdbd.conf<br>
[root@smaster ~]# mkdir /var/log/slurm<br>
[root@smaster ~]# touch /var/log/slurm/slurmdbd.log<br>
[root@smaster ~]# chown slurm: /var/log/slurm/slurmdbd.log<br>
[root@smaster ~]# scontrol show config | grep AccountingStorageHost<br>
AccountingStorageHost   = localhost<br>
<br>
Note:<br>
i have edited file /etc/slurm/slurm.conf and modified the below line<br>
# LOGGING AND ACCOUNTING<br>
AccountingStorageType=accounting_storage/slurmdbd<br>
Then restarted all the services<br>
<br>
[root@smaster ~]# for i in munge slurmd slurmctld slurmdbd; do service $i<br>
status; done<br>
Redirecting to /bin/systemctl status munge.service<br>
? munge.service - MUNGE authentication service<br>
   Loaded: loaded (/usr/lib/systemd/system/munge.service; enabled; vendor<br>
preset: disabled)<br>
   Active: active (running) since Tue 2021-02-02 13:21:10 IST; 3h 36min ago<br>
     Docs: man:munged(8)<br>
 Main PID: 20613 (munged)<br>
   CGroup: /system.slice/munge.service<br>
           ??20613 /usr/sbin/munged<br>
<br>
Feb 02 13:21:10 smaster.calligotech.com systemd[1]: Stopped MUNGE<br>
authentication service.<br>
Feb 02 13:21:10 smaster.calligotech.com systemd[1]: Starting MUNGE<br>
authentication service...<br>
Feb 02 13:21:10 smaster.calligotech.com systemd[1]: Started MUNGE<br>
authentication service.<br>
Redirecting to /bin/systemctl status slurmd.service<br>
? slurmd.service - Slurm node daemon<br>
   Loaded: loaded (/usr/lib/systemd/system/slurmd.service; enabled; vendor<br>
preset: disabled)<br>
   Active: active (running) since Tue 2021-02-02 13:21:10 IST; 3h 36min ago<br>
 Main PID: 20637 (slurmd)<br>
   CGroup: /system.slice/slurmd.service<br>
           ??20637 /usr/sbin/slurmd -D<br>
<br>
Feb 02 13:21:10 smaster.calligotech.com systemd[1]: Started Slurm node<br>
daemon.<br>
Feb 02 15:30:47 smaster.calligotech.com slurmd[20637]: slurmd: Launching<br>
batch job 7 for UID 0<br>
Feb 02 15:31:46 smaster.calligotech.com slurmd[20637]: slurmd: Launching<br>
batch job 8 for UID 0<br>
Feb 02 15:33:43 smaster.calligotech.com slurmd[20637]: slurmd: Launching<br>
batch job 9 for UID 0<br>
<br>
Redirecting to /bin/systemctl status slurmctld.service<br>
? slurmctld.service - Slurm controller daemon<br>
   Loaded: loaded (/usr/lib/systemd/system/slurmctld.service; enabled;<br>
vendor preset: disabled)<br>
   Active: active (running) since Tue 2021-02-02 13:21:11 IST; 3h 36min ago<br>
 Main PID: 20660 (slurmctld)<br>
   CGroup: /system.slice/slurmctld.service<br>
           ??20660 /usr/sbin/slurmctld -D<br>
<br>
Feb 02 13:21:11 smaster.calligotech.com systemd[1]: Started Slurm<br>
controller daemon.<br>
Redirecting to /bin/systemctl status slurmdbd.service<br>
? slurmdbd.service - Slurm DBD accounting daemon<br>
   Loaded: loaded (/usr/lib/systemd/system/slurmdbd.service; enabled;<br>
vendor preset: disabled)<br>
   Active: active (running) since Tue 2021-02-02 16:29:11 IST; 28min ago<br>
 Main PID: 24146 (slurmdbd)<br>
   CGroup: /system.slice/slurmdbd.service<br>
           ??24146 /usr/sbin/slurmdbd -D<br>
<br>
Feb 02 16:29:11 smaster.calligotech.com systemd[1]: Started Slurm DBD<br>
accounting daemon.<br>
[root@smaster ~]# srun --ntasks=2 --label /bin/hostname<br>
srun: job 22 queued and waiting for resources<br>
srun: job 22 has been allocated resources<br>
1: smaster.calligotech.com<br>
0: smaster.calligotech.com<br>
[root@smaster ~]#<br>
<br>
<br>
However when i run the below command<br>
<br>
[root@smaster ~]# sacct<br>
sacct: error: slurm_persist_conn_open_without_init: failed to open<br>
persistent connection to host:localhost:6819: Connection refused<br>
sacct: error: Sending PersistInit msg: Connection refused<br>
sacct: error: Problem talking to the database: Connection refused<br>
[root@smaster ~]#<br>
<br>
and i have troubleshooted below steps<br>
<br>
[root@smaster ~]# telnet localhost 6819<br>
Trying ::1...<br>
telnet: connect to address ::1: Connection refused<br>
Trying 127.0.0.1...<br>
telnet: connect to address 127.0.0.1: Connection refused<br>
[root@smaster ~]#<br>
<br>
[root@smaster ~]# mysql -p -u slurm slurm_acct_db<br>
Enter password:<br>
Welcome to the MariaDB monitor.  Commands end with ; or \g.<br>
Your MariaDB connection id is 9<br>
Server version: 10.1.48-MariaDB MariaDB Server<br>
<br>
Copyright (c) 2000, 2018, Oracle, MariaDB Corporation Ab and others.<br>
<br>
Type 'help;' or '\h' for help. Type '\c' to clear the current input<br>
statement.<br>
<br>
MariaDB [slurm_acct_db]> show tables;<br>
Empty set (0.00 sec)<br>
<br>
MariaDB [slurm_acct_db]><br>
<br>
Then i have added DBPort and restarted services<br>
[root@smaster ~]# cat /etc/slurm/slurmdbd.conf<br>
AuthType=auth/munge<br>
DbdAddr=localhost<br>
DbdHost=localhost<br>
*DbdPort=6819*<br>
SlurmUser=slurm<br>
DebugLevel=4<br>
LogFile=/var/log/slurm/slurmdbd.log<br>
PidFile=/var/run/slurmdbd.pid<br>
StorageType=accounting_storage/mysql<br>
StorageHost=localhost<br>
StoragePass=password<br>
StorageUser=slurm<br>
StorageLoc=slurm_acct_db<br>
[root@smaster ~]#<br>
<br>
[root@smaster ~]# for i in munge slurmd slurmctld slurmdbd; do service $i<br>
status; done<br>
Redirecting to /bin/systemctl status munge.service<br>
? munge.service - MUNGE authentication service<br>
   Loaded: loaded (/usr/lib/systemd/system/munge.service; enabled; vendor<br>
preset: disabled)<br>
   Active: active (running) since Tue 2021-02-02 13:21:10 IST; 3h 55min ago<br>
     Docs: man:munged(8)<br>
 Main PID: 20613 (munged)<br>
   CGroup: /system.slice/munge.service<br>
           ??20613 /usr/sbin/munged<br>
<br>
Feb 02 13:21:10 smaster.calligotech.com systemd[1]: Stopped MUNGE<br>
authentication service.<br>
Feb 02 13:21:10 smaster.calligotech.com systemd[1]: Starting MUNGE<br>
authentication service...<br>
Feb 02 13:21:10 smaster.calligotech.com systemd[1]: Started MUNGE<br>
authentication service.<br>
Redirecting to /bin/systemctl status slurmd.service<br>
? slurmd.service - Slurm node daemon<br>
   Loaded: loaded (/usr/lib/systemd/system/slurmd.service; enabled; vendor<br>
preset: disabled)<br>
   Active: active (running) since Tue 2021-02-02 13:21:10 IST; 3h 55min ago<br>
 Main PID: 20637 (slurmd)<br>
   CGroup: /system.slice/slurmd.service<br>
           ??20637 /usr/sbin/slurmd -D<br>
<br>
Feb 02 15:30:47 smaster.calligotech.com slurmd[20637]: slurmd: Launching<br>
batch job 7 for UID 0<br>
Feb 02 15:31:46 smaster.calligotech.com slurmd[20637]: slurmd: Launching<br>
batch job 8 for UID 0<br>
Feb 02 15:33:43 smaster.calligotech.com slurmd[20637]: slurmd: Launching<br>
batch job 9 for UID 0<br>
Feb 02 15:38:45 smaster.calligotech.com slurmd[20637]: slurmd: Launching<br>
batch job 12 for UID 0<br>
<br>
Redirecting to /bin/systemctl status slurmctld.service<br>
? slurmctld.service - Slurm controller daemon<br>
   Loaded: loaded (/usr/lib/systemd/system/slurmctld.service; enabled;<br>
vendor preset: disabled)<br>
   Active: active (running) since Tue 2021-02-02 13:21:11 IST; 3h 55min ago<br>
 Main PID: 20660 (slurmctld)<br>
   CGroup: /system.slice/slurmctld.service<br>
           ??20660 /usr/sbin/slurmctld -D<br>
<br>
Feb 02 13:21:11 smaster.calligotech.com systemd[1]: Started Slurm<br>
controller daemon.<br>
Redirecting to /bin/systemctl status slurmdbd.service<br>
? slurmdbd.service - Slurm DBD accounting daemon<br>
   Loaded: loaded (/usr/lib/systemd/system/slurmdbd.service; enabled;<br>
vendor preset: disabled)<br>
   Active: active (running) since Tue 2021-02-02 16:29:11 IST; 47min ago<br>
 Main PID: 24146 (slurmdbd)<br>
   CGroup: /system.slice/slurmdbd.service<br>
           ??24146 /usr/sbin/slurmdbd -D<br>
<br>
Feb 02 16:29:11 smaster.calligotech.com systemd[1]: Started Slurm DBD<br>
accounting daemon.<br>
[root@smaster ~]# ps -ef |grep slurm<br>
root     20637     1  0 13:21 ?        00:00:00 /usr/sbin/slurmd -D<br>
slurm    20660     1  0 13:21 ?        00:00:08 /usr/sbin/slurmctld -D<br>
root     24146     1  0 16:29 ?        00:00:00 /usr/sbin/slurmdbd -D<br>
root     25395 18378  0 17:17 pts/2    00:00:00 grep --color=auto slurm<br>
[root@smaster ~]# sacct<br>
sacct: error: slurm_persist_conn_open_without_init: failed to open<br>
persistent connection to host:localhost:6819: Connection refused<br>
sacct: error: Sending PersistInit msg: Connection refused<br>
sacct: error: Problem talking to the database: Connection refused<br>
[root@smaster ~]#<br>
<br>
[root@smaster ~]# tail /var/log/slurm/slurmdbd.log<br>
[2021-02-02T17:16:01.913] error: mysql_real_connect failed: 2005 Unknown<br>
MySQL server host 'smater' (-2)<br>
[2021-02-02T17:16:01.913] error: The database must be up when starting the<br>
MYSQL plugin.  Trying again in 5 seconds.<br>
[2021-02-02T17:16:06.963] error: mysql_real_connect failed: 2005 Unknown<br>
MySQL server host 'smater' (-2)<br>
[2021-02-02T17:16:06.963] error: The database must be up when starting the<br>
MYSQL plugin.  Trying again in 5 seconds.<br>
[2021-02-02T17:16:12.083] error: mysql_real_connect failed: 2005 Unknown<br>
MySQL server host 'smater' (-2)<br>
[2021-02-02T17:16:12.083] error: The database must be up when starting the<br>
MYSQL plugin.  Trying again in 5 seconds.<br>
[2021-02-02T17:16:17.140] error: mysql_real_connect failed: 2005 Unknown<br>
MySQL server host 'smater' (-2)<br>
[2021-02-02T17:16:17.141] error: The database must be up when starting the<br>
MYSQL plugin.  Trying again in 5 seconds.<br>
[2021-02-02T17:16:22.804] error: mysql_real_connect failed: 2005 Unknown<br>
MySQL server host 'smater' (-2)<br>
[2021-02-02T17:16:22.804] error: The database must be up when starting the<br>
MYSQL plugin.  Trying again in 5 seconds.<br>
[root@smaster ~]#<br>
<br>
Still the problem remains the same. Please help me to resolve this issue.<br>
<br>
Regards,<br>
Zain<br>
-------------- next part --------------<br>
An HTML attachment was scrubbed...<br>
URL: <<a href="http://lists.schedmd.com/pipermail/slurm-users/attachments/20210202/f2348489/attachment-0001.htm">http://lists.schedmd.com/pipermail/slurm-users/attachments/20210202/f2348489/attachment-0001.htm</a>><br>
<br>
------------------------------<br>
<br>
Message: 2<br>
Date: Tue, 2 Feb 2021 16:16:09 +0300<br>
From: Benson Muite <benson_muite@emailplus.org><br>
To: slurm-users@lists.schedmd.com<br>
Subject: Re: [slurm-users] Slurm - Munge configuration details<br>
Message-ID: <bd36d545-4fd7-05ec-4a51-bb2743258b34@emailplus.org><br>
Content-Type: text/plain; charset=utf-8; format=flowed<br>
<br>
On 2/2/21 4:00 PM, Zainul Abiddin wrote:<br>
> Hi Benson,<br>
> <br>
> I am not able to do passwordless ssh? between master and compute nodes <br>
> using Munge service.<br>
> when i am running below command , here it is asking for a password for <br>
> the compute node.<br>
> <br>
> /Am I configuring properly or not, so I need clarity on this?/<br>
> <br>
> [root@smaster ~]# munge -n | ssh snode unmunge<br>
> root@snode's password:<br>
> STATUS: ? ? ? ? ? Success (0)<br>
> ENCODE_HOST: smaster.calligotech.com <br>
> <<a href="http://smaster.calligotech.com/%3e?(192.168.1.195">http://smaster.calligotech.com/>?(192.168.1.195</a>)<br>
> ENCODE_TIME: ? ? ?2021-02-01 13:58:16 +0530 (1612168096)<br>
> DECODE_TIME: ? ? ?2021-02-01 13:58:21 +0530 (1612168101)<br>
> TTL: ? ? ? ? ? ? ?300<br>
> CIPHER: ? ? ? ? ? aes128 (4)<br>
> MAC: ? ? ? ? ? ? ?sha1 (3)<br>
> ZIP: ? ? ? ? ? ? ?none (0)<br>
> UID: ? ? ? ? ? ? ?root (0)<br>
> GID: ? ? ? ? ? ? ?root (0)<br>
> LENGTH: ? ? ? ? ? 0<br>
> <br>
> [root@smaster ~]#<br>
> <br>
> Regards,<br>
> Zain<br>
> <br>
Hi Zain,<br>
<br>
Perhaps try using the ipaddress instead of the hostname?<br>
<br>
Also, are clocks synchronized? See<br>
<a href="https://slurm.schedmd.com/quickstart_admin.html">https://slurm.schedmd.com/quickstart_admin.html</a><br>
Benson<br>
<br>
<br>
<br>
End of slurm-users Digest, Vol 40, Issue 4<br>
******************************************<o:p></o:p></p>
</div>
</div>
</body>
</html>