[slurm-users] DBD_SEND_MULT_MSG - invalid uid error
Timony, Mick
Michael_Timony at hms.harvard.edu
Mon Jan 8 19:48:42 UTC 2024
This ticket with SchedMD implies it's a munged issue:
https://bugs.schedmd.com/show_bug.cgi?id=1293
Is the munge daemon running on all systems? If it is, are all servers running a network time daemon such chronyd or ntpd and the time is in sync on all hosts?
Regards
--Mick
________________________________
From: slurm-users <slurm-users-bounces at lists.schedmd.com> on behalf of Craig Stark <cestark at ad.uci.edu>
Sent: Monday, January 8, 2024 1:51 PM
To: slurm-users at lists.schedmd.com <slurm-users at lists.schedmd.com>
Subject: Re: [slurm-users] DBD_SEND_MULT_MSG - invalid uid error
3rd time trying to get this to come through to the list - hopefully this time works.
I've been running SLURM for several years now, but in setting it up on a new cluster, I'm hitting a recurring issue. I'm using a MariaDB and configured it just as I had in my several-year-ago setup and in the docs. There's a "slurm" user (59999) on the OS (Rocky 9), that's on all the nodes, and I've added the slurm at localhost as instructed (grant all on slurm_acct_db.* TO 'slurm'@'localhost' identified by 'PASSWORD'). But, I keep getting things like this:
```
Dec 22 14:22:07 kirby slurmdbd[14518]: slurmdbd: error: DBD_SEND_MULT_MSG message from invalid uid 59999
Dec 22 14:22:07 kirby slurmdbd[14518]: slurmdbd: error: Processing last message from connection 7(192.168.1.2) uid(59999)
Dec 22 14:22:07 kirby slurmdbd[14518]: slurmdbd: error: CONN:7 DBD_REGISTER_CTLD message from invalid uid 59999
Dec 22 14:22:07 kirby slurmdbd[14518]: slurmdbd: error: CONN:7 Security violation, DBD_REGISTER_CTLD
Dec 22 14:22:07 kirby slurmdbd[14518]: slurmdbd: error: Processing last message from connection 7(192.168.1.2) uid(59999)
```
I'm a total SQL noob, but can at least verify that the user is in there:
MariaDB [(none)]> SELECT User, Host, Password FROM mysql.user;
+-------------+-----------+-------------------------------------------+
| User | Host | Password |
+-------------+-----------+-------------------------------------------+
| mariadb.sys | localhost | |
| root | localhost | invalid |
| mysql | localhost | invalid |
| slurm | localhost | *D6665ECF4F3CB12BCA836117F7727B6D0B78D644 |
+-------------+-----------+-------------------------------------------+
4 rows in set (0.002 sec)
Any thoughts as to where I might look to fix this?
Craig
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20240108/d9f02f00/attachment-0001.htm>
More information about the slurm-users
mailing list