<div dir="ltr">I used a similar ./configure command:<div><br></div><div>./configure --prefix=/home/dean/src/slurm.versions/slurm-19.05.4.build --sysconfdir=/etc/slurm --enable-pam --with-pam_dir=/lib/x86_64-linux-gnu/security/ --without-shared-libslurm --enable-debug<br></div><div><br></div><div>The problem is that the path in --prefix is getting into the slurmd binary. The slurmd binary expects to find the stepd binary there instead of looking in /usr/local/sbin where the binaries get installed.</div><div><br></div><div>I've seen this happen when the slurmd.service file has that path in it, but that isn't the case.</div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Tue, Feb 18, 2020 at 3:53 PM Alex Chekholko <<a href="mailto:alex@calicolabs.com">alex@calicolabs.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div>Hey Dean,</div><div><br></div>Here is what I found in my build notes which are now outdated by 1 year at least, but probably there are some more configure parameters you want to specify with relevant directories:<div><br></div><div>./configure --prefix=/tmp/slurm-build --sysconfdir=/etc/slurm --enable-pam --with-pam_dir=/lib/x86_64-linux-gnu/security/ --without-shared-libslurm<br></div><div><br></div><div>Regards,</div><div>Alex</div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Tue, Feb 18, 2020 at 2:45 PM Dean Schulze <<a href="mailto:dean.w.schulze@gmail.com" target="_blank">dean.w.schulze@gmail.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">I built slurm on one machine (controller) and copied the new slurmd binary to a node. When I started it systemctl it failed with the message:<div><br></div><div>fatal: Unable to find slurmstepd file at /home/dean/src/slurm.versions/slurm-19.05.4.build/<br></div><div><br></div><div>The path it refers to is what I gave to ./configure --prefix==... on the controller where I built the binaries. The --prefix= value is used by the make install step to output the slurm* binaries it creates to. That path also gets written into the generated .service files. for ExecStart=.. I change the ExecStart= in the .service files to /usr/local/sbin where I place the slurm* binaries.</div><div><br></div><div>Here's my slurmd.service file on my node:</div><div><br></div><div>Unit]<br>Description=Slurm node daemon<br>After=munge.service network.target remote-fs.target<br>ConditionPathExists=/etc/slurm/slurm.conf<br><br>[Service]<br>Type=forking<br>EnvironmentFile=-/etc/sysconfig/slurmd<br>ExecStart=/usr/local/sbin/slurmd $SLURMD_OPTIONS<br>ExecReload=/bin/kill -HUP $MAINPID<br>PIDFile=/var/run/slurmd.pid<br>KillMode=process<br>LimitNOFILE=131072<br>LimitMEMLOCK=infinity<br>LimitSTACK=infinity<br>Delegate=yes<br>TasksMax=infinity<br><br>[Install]<br>WantedBy=multi-user.target<br></div><div><br></div><div>Why is the slurmd binary looking for the build path? That path is not in any .service or .conf file on the node.<br></div></div>
</blockquote></div>
</blockquote></div>