<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body>
<p>That's what we do here. We have three different rpms we build.</p>
<p>server: because we run the latest MariaDB on our master</p>
<p>general compute</p>
<p>gpu compute: because we build against nvml</p>
<p>We name these all the same but have them in different repos and
distribute the repos to each node appropriately.</p>
<p>We also have a git repo in which we manage our slurm.spec file
with a branch for each version and type so we can keep organized.</p>
<p>-Paul Edmon-<br>
</p>
<div class="moz-cite-prefix">On 9/24/2020 3:31 PM, Dana, Jason T.
wrote:<br>
</div>
<blockquote type="cite"
cite="mid:F3B12A7B-D05D-4C22-A535-DC09EBEEC230@jhuapl.edu">
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<meta name="Generator" content="Microsoft Word 15 (filtered
medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
font-size:12.0pt;
font-family:"Calibri",sans-serif;}
span.EmailStyle17
{mso-style-type:personal-compose;
font-family:"Calibri",sans-serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:12.0pt;}size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}</style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
<div class="WordSection1">
<p class="MsoNormal"><span style="font-size:11.0pt">Hello,<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">I hopefully
have a quick question.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">I have
compiled Slurm RPMs on a CentOS system with nvidia drivers
installed so that I can utilize AutoDetect=nvml
configuration in our GPU nodes’ gres.conf. All seems to be
going well on the GPU nodes since I have done that. I was
unable to install the slurm RPM on the control/master node
as the RPM required libnvidia-ml.so to be installed. The
control/master and other compute nodes don’t have any nvidia
cards attached to them, so I believed installing the drivers
just to satisfy this requirement might not be the best idea.
I recreated the RPM without the drivers present to get
around this and everything has been working great as far as
I can tell.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">I am now
working on adding pmix support that I didn’t properly add
initially and am encountering this situation again. I
figured I would send up a flag and see if maybe I am going
about this the wrong way. Is it typical to have to compile
the slurm RPMs for different types of nodes or am I
completely going about this the wrong way?<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">Thanks in
advance! <o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black">Jason<o:p></o:p></span></p>
</div>
</blockquote>
</body>
</html>