<div dir="ltr">Dear Jeffrey,<div><br></div><div>Thank you for your response. I have followed the steps as instructed. After the copying the files to their respective locations "systemctl status slurmctld.service" command gives me an error as follows: </div><div><br></div>(base) [nousheen@exxact system]$ systemctl daemon-reload<br>(base) [nousheen@exxact system]$ systemctl enable slurmctld.service<br>(base) [nousheen@exxact system]$ systemctl start slurmctld.service<br>(base) [nousheen@exxact system]$ systemctl status slurmctld.service<br>● slurmctld.service - Slurm controller daemon<br> Loaded: loaded (/etc/systemd/system/slurmctld.service; enabled; vendor preset: disabled)<br> Active: failed (Result: exit-code) since Mon 2022-01-31 10:04:31 PKT; 3s ago<br> Process: 18114 ExecStart=/usr/local/sbin/slurmctld -D -s $SLURMCTLD_OPTIONS (code=exited, status=1/FAILURE)<br> Main PID: 18114 (code=exited, status=1/FAILURE)<br><br>Jan 31 10:04:31 exxact systemd[1]: Started Slurm controller daemon.<br>Jan 31 10:04:31 exxact systemd[1]: slurmctld.service: main process exited, code=exited, status=1/FAILURE<br>Jan 31 10:04:31 exxact systemd[1]: Unit slurmctld.service entered failed state.<br>Jan 31 10:04:31 exxact systemd[1]: slurmctld.service failed.<br><div> </div><div><br></div><div>Kindly guide me. Thank you so much for your time. </div><div><br clear="all"><div><div dir="ltr" class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr">Best Regards,</div><div dir="ltr"><span style="font-family:arial;font-size:small">Nousheen Parvaiz</span><br style="font-family:arial;font-size:small"><div style="font-family:arial;font-size:small"> </div></div></div></div></div></div></div></div></div></div><br></div></div><div hspace="streak-pt-mark" style="max-height:1px"><img alt="" style="width:0px;max-height:0px;overflow:hidden" src="https://mailfoogae.appspot.com/t?sender=abm91c2hlZW5wYXJ2YWl6QGdtYWlsLmNvbQ%3D%3D&type=zerocontent&guid=7f4e3a9b-d0f7-4027-b3b1-c8540874f51c"><font color="#ffffff" size="1">ᐧ</font></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Thu, Jan 27, 2022 at 8:25 PM Jeffrey R. Lang <<a href="mailto:JRLang@uwyo.edu">JRLang@uwyo.edu</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div lang="EN-US" style="overflow-wrap: break-word;">
<div class="gmail-m_-7165199381630829102WordSection1">
<p class="MsoNormal">The missing file error has nothing to do with slurm. The systemctl command is part of the systems service management.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">The error message indicates that you haven’t copied the slurmd.service file on your compute node to /etc/systemd/system or /usr/lib/systemd/system. /etc/systemd/system is usually used when a user adds a new service to a machine.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Depending on your version of Linux you may also need to do a systemctl daemon-reload to activate the slurmd.service within system.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Once slurmd.service is copied over, the systemctld command should work just fine.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Remember:<u></u><u></u></p>
<p class="MsoNormal"> slurmd.service - Only on compute nodes<u></u><u></u></p>
<p class="MsoNormal"> slurmctld.service – Only on your cluster management node<u></u><u></u></p>
<p class="MsoNormal"> slurmdbd.service – Only on your cluster management node<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<div>
<div style="border-right:none;border-bottom:none;border-left:none;border-top:1pt solid rgb(225,225,225);padding:3pt 0in 0in">
<p class="MsoNormal"><b>From:</b> slurm-users <<a href="mailto:slurm-users-bounces@lists.schedmd.com" target="_blank">slurm-users-bounces@lists.schedmd.com</a>>
<b>On Behalf Of </b>Nousheen<br>
<b>Sent:</b> Thursday, January 27, 2022 3:54 AM<br>
<b>To:</b> Slurm User Community List <<a href="mailto:slurm-users@lists.schedmd.com" target="_blank">slurm-users@lists.schedmd.com</a>><br>
<b>Subject:</b> [slurm-users] systemctl enable slurmd.service Failed to execute operation: No such file or directory<u></u><u></u></p>
</div>
</div>
<p class="MsoNormal"><u></u> <u></u></p>
<div style="border:2.25pt solid red;padding:1pt 4pt">
<p class="MsoNormal" style="line-height:11.35pt">
<span style="font-family:"Cambria Math",serif">◆</span> This message was sent from a non-UWYO address. Please exercise caution when clicking links or opening attachments from external sources.<u></u><u></u></p>
</div>
<p class="MsoNormal"><u></u> <u></u></p>
<div>
<div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">Hello everyone,<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">I am installing slurm on Centos 7 following tutorial: <a href="https://www.slothparadise.com/how-to-install-slurm-on-centos-7-cluster/" target="_blank">https://www.slothparadise.com/how-to-install-slurm-on-centos-7-cluster/</a><u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">I am at the step where we start slurm but it gives me the following error:<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<p class="MsoNormal">[root@exxact slurm-21.08.5]# systemctl enable slurmd.service<u></u><u></u></p>
<div>
<p class="MsoNormal">Failed to execute operation: No such file or directory<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">I have run the command to check if slurm is configured properly<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<p class="MsoNormal">[root@exxact slurm-21.08.5]# slurmd -C<br>
NodeName=exxact CPUs=12 Boards=1 SocketsPerBoard=1 CoresPerSocket=6 ThreadsPerCore=2 RealMemory=31889<br>
UpTime=19-16:06:00<u></u><u></u></p>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">I am new to this and unable to understand the problem. Kindly help me resolve this.<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">My slurm.conf file is as follows:<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<p class="MsoNormal"># slurm.conf file generated by configurator.html.<br>
# Put this file on all nodes of your cluster.<br>
# See the slurm.conf man page for more information.<br>
#<br>
ClusterName=cluster194<br>
SlurmctldHost=192.168.60.194<br>
#SlurmctldHost=<br>
#<br>
#DisableRootJobs=NO<br>
#EnforcePartLimits=NO<br>
#Epilog=<br>
#EpilogSlurmctld=<br>
#FirstJobId=1<br>
#MaxJobId=67043328<br>
#GresTypes=<br>
#GroupUpdateForce=0<br>
#GroupUpdateTime=600<br>
#JobFileAppend=0<br>
#JobRequeue=1<br>
#JobSubmitPlugins=lua<br>
#KillOnBadExit=0<br>
#LaunchType=launch/slurm<br>
#Licenses=foo*4,bar<br>
#MailProg=/bin/mail<br>
#MaxJobCount=10000<br>
#MaxStepCount=40000<br>
#MaxTasksPerNode=512<br>
MpiDefault=none<br>
#MpiParams=ports=#-#<br>
#PluginDir=<br>
#PlugStackConfig=<br>
#PrivateData=jobs<br>
ProctrackType=proctrack/cgroup<br>
#Prolog=<br>
#PrologFlags=<br>
#PrologSlurmctld=<br>
#PropagatePrioProcess=0<br>
#PropagateResourceLimits=<br>
#PropagateResourceLimitsExcept=<br>
#RebootProgram=<br>
ReturnToService=1<br>
SlurmctldPidFile=/var/run/slurmctld.pid<br>
SlurmctldPort=6817<br>
SlurmdPidFile=/var/run/slurmd.pid<br>
SlurmdPort=6818<br>
SlurmdSpoolDir=/var/spool/slurmd<br>
SlurmUser=nousheen<br>
#SlurmdUser=root<br>
#SrunEpilog=<br>
#SrunProlog=<br>
StateSaveLocation=/home/nousheen/Documents/SILICS/slurm-21.08.5/slurmctld<br>
SwitchType=switch/none<br>
#TaskEpilog=<br>
TaskPlugin=task/affinity<br>
#TaskProlog=<br>
#TopologyPlugin=topology/tree<br>
#TmpFS=/tmp<br>
#TrackWCKey=no<br>
#TreeWidth=<br>
#UnkillableStepProgram=<br>
#UsePAM=0<br>
#<br>
#<br>
# TIMERS<br>
#BatchStartTimeout=10<br>
#CompleteWait=0<br>
#EpilogMsgTime=2000<br>
#GetEnvTimeout=2<br>
#HealthCheckInterval=0<br>
#HealthCheckProgram=<br>
InactiveLimit=0<br>
KillWait=30<br>
#MessageTimeout=10<br>
#ResvOverRun=0<br>
MinJobAge=300<br>
#OverTimeLimit=0<br>
SlurmctldTimeout=120<br>
SlurmdTimeout=300<br>
#UnkillableStepTimeout=60<br>
#VSizeFactor=0<br>
Waittime=0<br>
#<br>
#<br>
# SCHEDULING<br>
#DefMemPerCPU=0<br>
#MaxMemPerCPU=0<br>
#SchedulerTimeSlice=30<br>
SchedulerType=sched/backfill<br>
SelectType=select/cons_tres<br>
SelectTypeParameters=CR_Core<br>
#<br>
#<br>
# JOB PRIORITY<br>
#PriorityFlags=<br>
#PriorityType=priority/basic<br>
#PriorityDecayHalfLife=<br>
#PriorityCalcPeriod=<br>
#PriorityFavorSmall=<br>
#PriorityMaxAge=<br>
#PriorityUsageResetPeriod=<br>
#PriorityWeightAge=<br>
#PriorityWeightFairshare=<br>
#PriorityWeightJobSize=<br>
#PriorityWeightPartition=<br>
#PriorityWeightQOS=<br>
#<br>
#<br>
# LOGGING AND ACCOUNTING<br>
#AccountingStorageEnforce=0<br>
#AccountingStorageHost=<br>
#AccountingStoragePass=<br>
#AccountingStoragePort=<br>
AccountingStorageType=accounting_storage/none<br>
#AccountingStorageUser=<br>
#AccountingStoreFlags=<br>
#JobCompHost=<br>
#JobCompLoc=<br>
#JobCompPass=<br>
#JobCompPort=<br>
JobCompType=jobcomp/none<br>
#JobCompUser=<br>
#JobContainerType=job_container/none<br>
JobAcctGatherFrequency=30<br>
JobAcctGatherType=jobacct_gather/none<br>
SlurmctldDebug=info<br>
SlurmctldLogFile=/var/log/slurmctld.log<br>
SlurmdDebug=info<br>
SlurmdLogFile=/var/log/slurmd.log<br>
#SlurmSchedLogFile=<br>
#SlurmSchedLogLevel=<br>
#DebugFlags=<br>
#<br>
#<br>
# POWER SAVE SUPPORT FOR IDLE NODES (optional)<br>
#SuspendProgram=<br>
#ResumeProgram=<br>
#SuspendTimeout=<br>
#ResumeTimeout=<br>
#ResumeRate=<br>
#SuspendExcNodes=<br>
#SuspendExcParts=<br>
#SuspendRate=<br>
#SuspendTime=<br>
#<br>
#<br>
# COMPUTE NODES<br>
NodeName=linux[1-32] CPUs=11 State=UNKNOWN<u></u><u></u></p>
<div>
<p class="MsoNormal">PartitionName=debug Nodes=ALL Default=YES MaxTime=INFINITE State=UP <u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<p class="MsoNormal"><br clear="all">
<u></u><u></u></p>
<div>
<div>
<div>
<div>
<div>
<div>
<p class="MsoNormal">Best Regards,<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal" style="margin-bottom:12pt"><span style="font-size:12pt;font-family:Arial,sans-serif">Nousheen Parvaiz</span><u></u><u></u></p>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<div>
<p class="MsoNormal"><img border="0" width="1" height="1" style="width: 0.0104in; height: 0.0104in;" id="gmail-m_-7165199381630829102_x0000_i1025" src="https://mailfoogae.appspot.com/t?sender=abm91c2hlZW5wYXJ2YWl6QGdtYWlsLmNvbQ%3D%3D&type=zerocontent&guid=86d278c2-27fa-40b4-bdda-6bf66542615b"><span style="font-size:7.5pt;font-family:Gadugi,sans-serif;color:white">ᐧ</span><u></u><u></u></p>
</div>
</div>
</div>
</div>
</blockquote></div>