[slurm-users] Trying to troubleshoot slurmctld start failure

Sopena Ballesteros Manuel manuel.sopena at cscs.ch
Wed Oct 12 19:42:28 UTC 2022


Dear Slurm user community,


I am new to slurm and trying to start a slurmd and slurmctld on same machine. I started with slurmctld which is having issues.


$ slurmctld -D -f /etc/slurm/slurm.conf -vvv
slurmctld: debug:  slurmctld log levels: stderr=debug2 logfile=debug2 syslog=quiet
slurmctld: debug:  Log file re-opened
slurmctld: pidfile not locked, assuming no running daemon
slurmctld: debug:  slurmscriptd: Got ack from slurmctld, initialization successful
slurmctld: debug:  slurmctld: slurmscriptd fork()'d and initialized.
slurmctld: debug:  _slurmscriptd_mainloop: started
slurmctld: debug:  _slurmctld_listener_thread: started listening to slurmscriptd
slurmctld: slurmctld version 22.05.4 started on cluster cluster-nomad
slurmctld: cred/munge: init: Munge credential signature plugin loaded
slurmctld: debug:  auth/munge: init: Munge authentication plugin loaded
slurmctld: select/cons_res: common_init: select/cons_res loaded
slurmctld: select/cons_tres: common_init: select/cons_tres loaded
slurmctld: select/cray_aries: init: Cray/Aries node selection plugin loaded
slurmctld: preempt/none: init: preempt/none loaded
slurmctld: debug:  acct_gather_energy/none: init: AcctGatherEnergy NONE plugin loaded
slurmctld: debug:  acct_gather_profile/none: init: AcctGatherProfile NONE plugin loaded
slurmctld: debug:  acct_gather_interconnect/none: init: AcctGatherInterconnect NONE plugin loaded
slurmctld: debug:  acct_gather_filesystem/none: init: AcctGatherFilesystem NONE plugin loaded
slurmctld: debug2: No acct_gather.conf file (/etc/slurm/acct_gather.conf)
slurmctld: debug:  jobacct_gather/none: init: Job accounting gather NOT_INVOKED plugin loaded
slurmctld: ext_sensors/none: init: ExtSensors NONE plugin loaded
slurmctld: debug:  MPI: Loading all types
slurmctld: error:  mpi/pmix_v3: init: (null) [0]: mpi_pmix.c:195: pmi/pmix: can not load PMIx library
slurmctld: error: Couldn't load specified plugin name for mpi/pmix_v3: Plugin init() callback failed
slurmctld: error: MPI: Cannot create context for mpi/pmix_v3
slurmctld: debug2: No mpi.conf file (/etc/slurm/mpi.conf)
slurmctld: accounting_storage/none: init: Accounting storage NOT INVOKED plugin loaded
slurmctld: debug:  create_mmap_buf: Failed to mmap file `/var/spool/slurmctld/assoc_usage`, No such device
slurmctld: debug2: No Assoc usage file (/var/spool/slurmctld/assoc_usage) to recover
slurmctld: debug:  switch Cray/Aries plugin loaded.
slurmctld: debug:  switch/none: init: switch NONE plugin loaded
slurmctld: debug:  Reading slurm.conf file: /etc/slurm/slurm.conf
slurmctld: debug:  NodeNames=x1004c1s5b0n0 setting Sockets=10 based on CPUs(10)/(CoresPerSocket(1)/ThreadsPerCore(1))
slurmctld: No memory enforcing mechanism configured.
slurmctld: topology/none: init: topology NONE plugin loaded
slurmctld: debug:  No DownNodes
slurmctld: debug:  slurmctld log levels: stderr=debug2 logfile=debug2 syslog=quiet
slurmctld: debug:  Log file re-opened
slurmctld: sched: Backfill scheduler plugin loaded
slurmctld: route/default: init: route default plugin loaded
slurmctld: debug:  _slurmscriptd_mainloop: finished
Segmentation fault

Could someone please help me understand what the issue is?


thank you
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20221012/754fa520/attachment.htm>


More information about the slurm-users mailing list