[slurm-users] SlurmDBD setup with mysql
Matthew BETTINGER
matthew.bettinger at external.total.com
Thu Jan 17 17:21:46 UTC 2019
Not Sure if this is related but we ran into an issue configuring accounting because our clustername had a '-' in the name . This is an illegal character for table names in mariadb, or used to be.
On 1/17/19, 11:07 AM, "slurm-users on behalf of Sajesh Singh" <slurm-users-bounces at lists.schedmd.com on behalf of ssingh at amnh.org> wrote:
Trying to setup accounting using the MySQL backend and I am getting errors from the slurmctld and slurm tools when trying to interact with the accounting database. Tried starting in debug as well, but could not see anything else that could
point to what could be causing this issue. I have followed all steps listed at:
https://slurm.schedmd.com/accounting.html
The DBD daemon starts up without error as can be seen from the logs:
Slurmdbd.log:
[2019-01-17T10:18:40.365] chdir to /var/tmp
[2019-01-17T10:18:40.367] slurmdbd version 18.08.4 started
Though in the slurmctld.log file I am seeing the following:
[2019-01-17T10:19:40.024] error: Persistent Conn: read: No error
[2019-01-17T10:19:40.024] error: Persistent Conn: only read 117 of 2613 bytes
[2019-01-17T10:19:40.025] error: Persistent Conn: read: No error
[2019-01-17T10:19:40.025] error: Persistent Conn: only read 117 of 2613 bytes
[2019-01-17T10:19:40.025] error: Persistent Conn: read: No error
[2019-01-17T10:19:40.025] error: Persistent Conn: only read 119 of 1375731712 bytes
[2019-01-17T10:19:40.025] error: Persistent Conn: read: No error
[2019-01-17T10:19:40.025] error: Persistent Conn: only read 117 of 2613 bytes
[2019-01-17T10:19:40.025] error: slurm_persist_conn_open: Failed to unpack persistent connection init resp message from 127.0.0.1:3306
[2019-01-17T10:19:40.025] error: slurmdbd: Sending PersistInit msg: No error
I do restart all services in the order of munge, dbd, slurmctld after config changes.
The relevant configs are as follows:
slurmdbd.conf:
AuthType=auth/munge
DbdAddr=127.0.0.1
DbdHost=localhost
DbdPort=7031
SlurmUser=slurm
LogFile=/var/log/slurmdbd.log
PidFile=/var/run/slurmdbd.pid
StorageType=accounting_storage/mysql
StorageHost=localhost
StoragePort=3306
StoragePass=xxxxxxxxxxxxxxxxxxxxxxxxxxxxx
StorageUser=slurm
StorageLoc=slurm_acct_db
slurm.conf:
AccountingStorageType=accounting_storage/slurmdbd
AccountingStorageHost=127.0.0.1
AccountingStoragePort=3306
AccountingStorageUser=slurm
AccountingStorageEnforce=limits
AccountingStoreJobComment=YES
AcctGatherNodeFreq=30
ClusterName=ares
JobAcctGatherFrequency=30
JobAcctGatherType=jobacct_gather/linux
We are running Slurm 18.04 on CentOS 7 with MariaDB as the backend.
-SS-
More information about the slurm-users
mailing list