Dear slurm users,
It is my first time setting slurm up and I am looking for a solution to
this errors. Has anyone here already ecountered this problem. I would
really appreciate the help. mariadb, slurmdbd and slurmd are active.
*×* slurmctld.service - Slurm controller daemon
Loaded: loaded (/usr/lib/systemd/system/slurmctld.service; *enabled*;
preset: *enabled*)
Active: *failed* (Result: exit-code) since Tue 2024-06-25 10:06:39
UTC; 2min 42s ago
Duration: 584ms
Docs: man:slurmctld(8)
Process: 63738 ExecStart=/usr/sbin/slurmctld --systemd
$SLURMCTLD_OPTIONS *(code=exited, status=1/FAILURE)*
Main PID: 63738 (code=exited, status=1/FAILURE)
CPU: 25ms
Jun 25 10:06:39 server systemd[1]: Starting slurmctld.service - Slurm
controller daemon...
Jun 25 10:06:39 server (lurmctld)[63738]: *slurmctld.service: Referenced
but unset environment variable evaluates to an empty string:
SLURMCTLD_OPTIONS*
Jun 25 10:06:39 server slurmctld[63738]: slurmctld: slurmctld version
23.11.4 started on servercluster
Jun 25 10:06:39 server systemd[1]: Started slurmctld.service - Slurm
controller daemon.
Jun 25 10:06:39 server slurmctld[63738]: slurmctld:
accounting_storage/slurmdbd: clusteracct_storage_p_register_ctld:
Registering slurmctld at port 6817 with slurmdbd
Jun 25 10:06:39 server slurmctld[63738]: slurmctld: priority/multifactor:
_read_last_decay_ran: No last decay
(/var/spool/slurm/state/priority_last_decay_ran)
to recover
Jun 25 10:06:39 server slurmctld[63738]: slurmctld: No memory enforcing
mechanism configured.
Jun 25 10:06:39 server slurmctld[63738]: slurmctld: fatal: Can not recover
last_conf_lite, incompatible version, (9472 not between 9728 and 10240),
start with '-i' to ignore this. Warning: using -i will lose the data that
can't be recovered.
Jun 25 10:06:39 server systemd[1]: *slurmctld.service: Main process exited,
code=exited, status=1/FAILURE*
Jun 25 10:06:39 server systemd[1]: *slurmctld.service: Failed with result
'exit-code'.*