<div dir="ltr"><div>Hey Graziano,</div><div><br></div>To make your decision more "data-driven", you can pipe your SLURM accounting logs into a tool like XDMOD which will make you pie charts of usage by user, group, job, gres, etc.<div><br></div><div><a href="https://open.xdmod.org/8.0/index.html">https://open.xdmod.org/8.0/index.html</a><br></div><div><br></div><div>You may also consider assigning this task to one of your "machine learning" researchers and ask them to "predict" the resources needed. :)</div><div><br></div><div>Regards,</div><div>Alex</div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Thu, Mar 21, 2019 at 8:48 AM Graziano D'Innocenzo <<a href="mailto:graziano.dinnocenzo@adaptcentre.ie">graziano.dinnocenzo@adaptcentre.ie</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Dear Slurm users,<br>
<br>
my team is managing a HPC cluster (running Slurm) for a research<br>
centre. We are planning to expand the cluster in the next couple of<br>
years and we are facing a problem. We would like to put a figure on<br>
how many resources will be needed on average for each user (in terms<br>
of CPU cores, RAM, GPUs) but we have almost one hundred researchers<br>
using the cluster for all sorts of different use cases so there isn't<br>
a typical workload that we could take as a model. Most of the work is,<br>
however, in the field of machine learning and deep learning. Users go<br>
all the range from first year PhD students with limited skills to<br>
researchers and professors with many years of experience.<br>
In principle we could use a mix of: looking at current usage patterns,<br>
user surveys, etc.<br>
<br>
I was just wondering whether anyone here, working in a similar<br>
setting, had some sort of guidelines that they have been using for<br>
budgeting hardware purchases and that they would be willing to share?<br>
<br>
Many thanks and regards<br>
<br>
<br>
<br>
--<br>
Graziano D'Innocenzo (PGP key: 9213BE46)<br>
Systems Administrator - ADAPT Centre<br>
<br>
</blockquote></div>