<html><head><meta http-equiv="Content-Type" content="text/html charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class="">I was struggling like crazy with this one a while ago.<div class="">Then I saw this in the slurm.conf man page:</div><div class=""><br class=""></div><div class="">AccountingStoragePass</div><div class=""><div class="">The password used to gain access to the database to store the accounting data. Only used for database type storage plugins, ignored otherwise. In the case of</div><div class=""> Slurm DBD (Database Daemon) with MUNGE authentication this can be configured to use a MUNGE daemon specifically configured to provide authentication between clus‐</div><div class=""> ters while the default MUNGE daemon provides authentication within a cluster. In that case, AccountingStoragePass should specify the named port to be used for</div><div class=""> communications with the alternate MUNGE daemon (e.g. "/var/run/munge/global.socket.2"). The default value is NULL. Also see DefaultStoragePass.</div></div><div class=""><br class=""></div><div class="">So in case you are using MUNGE, you leave this out in slurm.conf, because the path to the socket is used as default. You specify the database password only in slurmdbd.conf.</div><div class=""><br class=""></div><div class="">Cheers,</div><div class="">Barbara</div><div class=""><br class=""><div><blockquote type="cite" class=""><div class="">On 29 Nov 2017, at 14:28, Andy Riebs <<a href="mailto:andy.riebs@hpe.com" class="">andy.riebs@hpe.com</a>> wrote:</div><br class="Apple-interchange-newline"><div class="">
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" class="">
<div bgcolor="#FFFFFF" text="#000000" class="">
It looks like you don't have the munged daemon running.<br class="">
<br class="">
<div class="moz-cite-prefix">On 11/29/2017 08:01 AM, Bruno Santos
wrote:<br class="">
</div>
<blockquote type="cite" cite="mid:CANUurQoD-+8i3yM8WqXM1epdLTQYWsrRH52M1hXaan9m3Pe23Q@mail.gmail.com" class="">
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" class="">
<div dir="ltr" class="">Hi everyone,
<div class=""><br class="">
</div>
<div class="">I have set-up slurm to use slurm_db and all was working
fine. However I had to change the slurm.conf to play with user
priority and upon restarting the slurmctl is fails with the
following messages below. It seems that somehow is trying to
use the mysql password as a munge socket? </div>
<div class="">Any idea how to solve it? </div>
<div class=""> </div>
<div class="">
<blockquote class="gmail_quote" style="margin:0px 0px 0px
0.8ex;border-left:1px solid
rgb(204,204,204);padding-left:1ex">Nov 29 12:56:30 plantae
slurmctld[29613]: Registering slurmctld at port 6817 with
slurmdbd.<br class="">
Nov 29 12:56:32 plantae slurmctld[29613]: error: If munged
is up, restart with --num-threads=10<br class="">
Nov 29 12:56:32 plantae slurmctld[29613]: error: Munge
encode failed: Failed to access "magic": No such file or
directory<br class="">
Nov 29 12:56:32 plantae slurmctld[29613]: error:
authentication: Socket communication error<br class="">
Nov 29 12:56:32 plantae slurmctld[29613]: error:
slurm_persist_conn_open: failed to send persistent
connection init message to localhost:6819<br class="">
Nov 29 12:56:32 plantae slurmctld[29613]: error: slurmdbd:
Sending PersistInit msg: Protocol authentication error<br class="">
Nov 29 12:56:34 plantae slurmctld[29613]: error: If munged
is up, restart with --num-threads=10<br class="">
Nov 29 12:56:34 plantae slurmctld[29613]: error: Munge
encode failed: Failed to access "magic": No such file or
directory<br class="">
Nov 29 12:56:34 plantae slurmctld[29613]: error:
authentication: Socket communication error<br class="">
Nov 29 12:56:34 plantae slurmctld[29613]: error:
slurm_persist_conn_open: failed to send persistent
connection init message to localhost:6819<br class="">
Nov 29 12:56:34 plantae slurmctld[29613]: error: slurmdbd:
Sending PersistInit msg: Protocol authentication error<br class="">
Nov 29 12:56:36 plantae slurmctld[29613]: error: If munged
is up, restart with --num-threads=10<br class="">
Nov 29 12:56:36 plantae slurmctld[29613]: error: Munge
encode failed: Failed to access "magic": No such file or
directory<br class="">
Nov 29 12:56:36 plantae slurmctld[29613]: error:
authentication: Socket communication error<br class="">
Nov 29 12:56:36 plantae slurmctld[29613]: error:
slurm_persist_conn_open: failed to send persistent
connection init message to localhost:6819<br class="">
Nov 29 12:56:36 plantae slurmctld[29613]: error: slurmdbd:
Sending PersistInit msg: Protocol authentication error<br class="">
Nov 29 12:56:36 plantae slurmctld[29613]: fatal: It appears
you don't have any association data from your database. The
priority/multifactor plugin requires this information to run
correctly. Please check your database connection and try
again.<br class="">
Nov 29 12:56:36 plantae systemd[1]: slurmctld.service: Main
process exited, code=exited, status=1/FAILURE<br class="">
Nov 29 12:56:36 plantae systemd[1]: slurmctld.service: Unit
entered failed state.<br class="">
Nov 29 12:56:36 plantae systemd[1]: slurmctld.service:
Failed with result 'exit-code'.</blockquote>
<div class=""><br class="">
</div>
<div class=""> </div>
</div>
</div>
</blockquote>
<br class="">
</div>
</div></blockquote></div><br class=""></div></body></html>