[slurm-users] How to fix “slurmd.service: Can't open PID file” error
mercan
ahmet.mercan at uhem.itu.edu.tr
Tue Jun 18 12:27:51 UTC 2019
Hi;
I did not notice
SlurmUser=noki
line. The owner of the /var/run/slurm-llnl directory and the
slurmctld.pid and slurmd.pid files should be "noki" user.
chown -R noki:root /var/spool/slurm-llnl
Regards;
Ahmet M.
On 18.06.2019 15:15, mercan wrote:
> Hi;
>
> The owner of the /var/run/slurm-llnl directory and the slurmctld.pid
> and slurmd.pid files should be "slurm" user. Your files owner are root
> and noki.
>
> chown -R slurm:slurm /var/spool/slurm-llnl
>
>
> Regards;
>
> Ahmet M.
>
>
> On 18.06.2019 15:03, Noki Lee wrote:
>>
>> Though SLURM works fine for job submitting, running, and queueing, I
>> got a minor error below.
>>
>> |sudo systemctl status slurmd|
>>
>> |Jun 12 10:20:40 noki-System-Product-Name systemd[1]: slurmd.service:
>> Can't open PID file /var/run/slurm-llnl/slurmd.pid (yet?) after
>> start: No such file or directory|
>>
>> |sudo systemctl status slurmctld|
>>
>> |Jun 12 10:20:40 noki-System-Product-Name systemd[1]: slurmd.service:
>> Can't open PID file /var/run/slurm-llnl/slurmd.pid (yet?) after
>> start: No such file or directory|
>>
>> I followed the installation of a guide from
>>
>> ftp://www.microway.com/pub/pub/for-customer/SDSU-Training/Webinar_2_Slurm_II--Ubuntu16.04_and_18.04.pdf
>>
>>
>> This problem may come from the ownership of slurm.conf file?
>>
>> Here are my slurm.conf and ownership for slur*.pid
>>
>> |# slurm.conf file generated by configurator easy.html. # Put this
>> file on all nodes of your cluster. # See the slurm.conf man page for
>> more information. # ControlMachine=noki-System-Product-Name
>> #ControlAddr= # #MailProg=/bin/mail MpiDefault=none
>> #MpiParams=ports=#-# ProctrackType=proctrack/pgid ReturnToService=1
>> SlurmctldPidFile=/var/run/slurm-llnl/slurmctld.pid
>> #SlurmctldPort=6817 SlurmdPidFile=/var/run/slurm-llnl/slurmd.pid
>> #SlurmdPort=6818 SlurmdSpoolDir=/var/spool/slurmd SlurmUser=noki
>> #SlurmdUser=root StateSaveLocation=/var/spool/slurm-llnl
>> SwitchType=switch/none TaskPlugin=task/none # # # TIMERS #KillWait=30
>> #MinJobAge=300 #SlurmctldTimeout=120 #SlurmdTimeout=300 # # #
>> SCHEDULING FastSchedule=1 SchedulerType=sched/backfill
>> SelectType=select/linear #SelectTypeParameters= # # # LOGGING AND
>> ACCOUNTING AccountingStorageType=accounting_storage/none
>> ClusterName=linux #JobAcctGatherFrequency=30
>> JobAcctGatherType=jobacct_gather/none #SlurmctldDebug=3
>> SlurmctldLogFile=/var/log/slurm-llnl/SlurmctldLogFile #SlurmdDebug=3
>> SlurmdLogFile=/var/log/slurm-llnl/SlurmdLogFile # # # COMPUTE NODES
>> NodeName=noki-System-Product-Name CPUs=4 RealMemory=6963 Sockets=1
>> CoresPerSocket=4 ThreadsPerCore=1 State=UNKNOWN PartitionName=debug
>> Nodes=noki-System-Product-Name Default=YES MaxTime=INFINITE State=UP |
>> |$ ls -l /var/run/slurm-llnl/ total 8 -rw-r--r-- 1 noki root 6 Jun 12
>> 10:20 slurmctld.pid -rw-r--r-- 1 root root 6 Jun 12 10:20 slurmd.pid|
>>
>
More information about the slurm-users
mailing list