[slurm-users] How to fix “slurmd.service: Can't open PID file” error

mercan ahmet.mercan at uhem.itu.edu.tr
Tue Jun 18 12:27:51 UTC 2019


Hi;

I did not notice

SlurmUser=noki

line. The owner of the /var/run/slurm-llnl directory and the 
slurmctld.pid and slurmd.pid files should be "noki" user.

chown -R noki:root /var/spool/slurm-llnl

Regards;

Ahmet M.


On 18.06.2019 15:15, mercan wrote:
> Hi;
>
> The owner of the /var/run/slurm-llnl directory and the slurmctld.pid 
> and slurmd.pid files should be "slurm" user. Your files owner are root 
> and noki.
>
> chown -R slurm:slurm /var/spool/slurm-llnl
>
>
> Regards;
>
> Ahmet M.
>
>
> On 18.06.2019 15:03, Noki Lee wrote:
>>
>> Though SLURM works fine for job submitting, running, and queueing, I 
>> got a minor error below.
>>
>> |sudo systemctl status slurmd|
>>
>> |Jun 12 10:20:40 noki-System-Product-Name systemd[1]: slurmd.service: 
>> Can't open PID file /var/run/slurm-llnl/slurmd.pid (yet?) after 
>> start: No such file or directory|
>>
>> |sudo systemctl status slurmctld|
>>
>> |Jun 12 10:20:40 noki-System-Product-Name systemd[1]: slurmd.service: 
>> Can't open PID file /var/run/slurm-llnl/slurmd.pid (yet?) after 
>> start: No such file or directory|
>>
>> I followed the installation of a guide from
>>
>> ftp://www.microway.com/pub/pub/for-customer/SDSU-Training/Webinar_2_Slurm_II--Ubuntu16.04_and_18.04.pdf 
>>
>>
>> This problem may come from the ownership of slurm.conf file?
>>
>> Here are my slurm.conf and ownership for slur*.pid
>>
>> |# slurm.conf file generated by configurator easy.html. # Put this 
>> file on all nodes of your cluster. # See the slurm.conf man page for 
>> more information. # ControlMachine=noki-System-Product-Name 
>> #ControlAddr= # #MailProg=/bin/mail MpiDefault=none 
>> #MpiParams=ports=#-# ProctrackType=proctrack/pgid ReturnToService=1 
>> SlurmctldPidFile=/var/run/slurm-llnl/slurmctld.pid 
>> #SlurmctldPort=6817 SlurmdPidFile=/var/run/slurm-llnl/slurmd.pid 
>> #SlurmdPort=6818 SlurmdSpoolDir=/var/spool/slurmd SlurmUser=noki 
>> #SlurmdUser=root StateSaveLocation=/var/spool/slurm-llnl 
>> SwitchType=switch/none TaskPlugin=task/none # # # TIMERS #KillWait=30 
>> #MinJobAge=300 #SlurmctldTimeout=120 #SlurmdTimeout=300 # # # 
>> SCHEDULING FastSchedule=1 SchedulerType=sched/backfill 
>> SelectType=select/linear #SelectTypeParameters= # # # LOGGING AND 
>> ACCOUNTING AccountingStorageType=accounting_storage/none 
>> ClusterName=linux #JobAcctGatherFrequency=30 
>> JobAcctGatherType=jobacct_gather/none #SlurmctldDebug=3 
>> SlurmctldLogFile=/var/log/slurm-llnl/SlurmctldLogFile #SlurmdDebug=3 
>> SlurmdLogFile=/var/log/slurm-llnl/SlurmdLogFile # # # COMPUTE NODES 
>> NodeName=noki-System-Product-Name CPUs=4 RealMemory=6963 Sockets=1 
>> CoresPerSocket=4 ThreadsPerCore=1 State=UNKNOWN PartitionName=debug 
>> Nodes=noki-System-Product-Name Default=YES MaxTime=INFINITE State=UP |
>> |$ ls -l /var/run/slurm-llnl/ total 8 -rw-r--r-- 1 noki root 6 Jun 12 
>> 10:20 slurmctld.pid -rw-r--r-- 1 root root 6 Jun 12 10:20 slurmd.pid|
>>
>



More information about the slurm-users mailing list