<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<meta name="Generator" content="Microsoft Exchange Server">
<!-- converted from text --><style><!-- .EmailQuote { margin-left: 1pt; padding-left: 4pt; border-left: #800000 2px solid; } --></style>
</head>
<body>
<meta content="text/html; charset=UTF-8">
<style type="text/css" style="">
<!--
p
{margin-top:0;
margin-bottom:0}
-->
</style>
<div dir="ltr">
<div id="x_divtagdefaultwrapper" dir="ltr" style="font-size:12pt; color:#000000; font-family:Calibri,Helvetica,sans-serif">
<p></p>
<p style="font-family:Calibri,Helvetica,sans-serif,EmojiFont,"Apple Color Emoji","Segoe UI Emoji",NotoColorEmoji,"Segoe UI Symbol","Android Emoji",EmojiSymbols; font-size:16px; border:0px solid rgb(217,217,227)">
Thank you for the responses.</p>
<p style="font-family:Calibri,Helvetica,sans-serif,EmojiFont,"Apple Color Emoji","Segoe UI Emoji",NotoColorEmoji,"Segoe UI Symbol","Android Emoji",EmojiSymbols; font-size:16px; border:0px solid rgb(217,217,227)">
<br>
</p>
<p style="font-family:Calibri,Helvetica,sans-serif,EmojiFont,"Apple Color Emoji","Segoe UI Emoji",NotoColorEmoji,"Segoe UI Symbol","Android Emoji",EmojiSymbols; font-size:16px; border:0px solid rgb(217,217,227)">
In response to some of the suggestions, I would like to provide further details on my specific use case. I am currently focused on exploring the concept of malleable jobs, which possess the ability to adapt their computing resources during runtime. </p>
<p style="font-family:Calibri,Helvetica,sans-serif,EmojiFont,"Apple Color Emoji","Segoe UI Emoji",NotoColorEmoji,"Segoe UI Symbol","Android Emoji",EmojiSymbols; font-size:16px; border:0px solid rgb(217,217,227)">
To tackle the MPI incompatibility issue associated with malleable jobs, There are solutions like Flex-MPI which extends the functionality of MPI to support resource adaptivity for malleable jobs during runtime. <span style="font-size:12pt">Furthermore, There
are scheduling algorithms tailored for malleable jobs. These algorithms aim to efficiently allocate resources and optimize job scheduling based on the dynamic nature of malleable jobs. </span></p>
<p style="font-family:Calibri,Helvetica,sans-serif,EmojiFont,"Apple Color Emoji","Segoe UI Emoji",NotoColorEmoji,"Segoe UI Symbol","Android Emoji",EmojiSymbols; font-size:16px; border:0px solid rgb(217,217,227)">
My primary objective is to understand how Slurm can effectively support malleable jobs. So I am investigating to find out how can SLURM support expand and shrink nodes during runtime.</p>
<p style="font-family:Calibri,Helvetica,sans-serif,EmojiFont,"Apple Color Emoji","Segoe UI Emoji",NotoColorEmoji,"Segoe UI Symbol","Android Emoji",EmojiSymbols; font-size:16px; border:0px solid rgb(217,217,227)">
<br>
</p>
<p style="font-family:Calibri,Helvetica,sans-serif,EmojiFont,"Apple Color Emoji","Segoe UI Emoji",NotoColorEmoji,"Segoe UI Symbol","Android Emoji",EmojiSymbols; font-size:16px; border:0px solid rgb(217,217,227)">
Best Regards</p>
<p style="font-family:Calibri,Helvetica,sans-serif,EmojiFont,"Apple Color Emoji","Segoe UI Emoji",NotoColorEmoji,"Segoe UI Symbol","Android Emoji",EmojiSymbols; font-size:16px; border:0px solid rgb(217,217,227)">
<br>
</p>
<p style="font-family:Calibri,Helvetica,sans-serif,EmojiFont,"Apple Color Emoji","Segoe UI Emoji",NotoColorEmoji,"Segoe UI Symbol","Android Emoji",EmojiSymbols; font-size:16px; border:0px solid rgb(217,217,227)">
Maysam</p>
<br>
<p></p>
</div>
<hr tabindex="-1" style="display:inline-block; width:98%">
<div id="x_divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" color="#000000" style="font-size:11pt"><b>From:</b> slurm-users <slurm-users-bounces@lists.schedmd.com> on behalf of Diego Zuccato <diego.zuccato@unibo.it><br>
<b>Sent:</b> Wednesday, June 28, 2023 4:15:44 PM<br>
<b>To:</b> slurm-users@lists.schedmd.com<br>
<b>Subject:</b> Re: [slurm-users] Dynamic Node Shrinking/Expanding for Running Jobs in Slurm</font>
<div> </div>
</div>
</div>
<font size="2"><span style="font-size:10pt;">
<div class="PlainText">IIUC it's not possible to increase resource usage once the job is
<br>
started: it would mess the scheduler and MPI comms (probably).<br>
<br>
But I also think you're trying to find a problem for a "solution". Just <br>
state the problem you're facing instead of proposing a solution :)<br>
What software are you running? How does it detect that a resize is <br>
needed? How would it handle the expansion?<br>
<br>
Diego<br>
<br>
Il 28/06/2023 13:02, Rahmanpour Koushki, Maysam ha scritto:<br>
> Dear Slurm Mailing List,<br>
> <br>
> <br>
> I hope this email finds you well. I am currently working on a project <br>
> that requires the ability to dynamically shrink or expand nodes for <br>
> running jobs in Slurm. However, I am facing some challenges and would <br>
> greatly appreciate your assistance and expertise in finding a solution.<br>
> <br>
> In my research, I came across the following resources:<br>
> <br>
> 1.<br>
> <br>
> Slurm Advanced Usage Tutorial: I found a tutorial<br>
> (<a href=""></a>https://slurm.schedmd.com/slurm_ug_2011/Advanced_Usage_Tutorial.pdf<br>
> <<a href="https://slurm.schedmd.com/slurm_ug_2011/Advanced_Usage_Tutorial.pdf">https://slurm.schedmd.com/slurm_ug_2011/Advanced_Usage_Tutorial.pdf</a>>) that discusses advanced features of Slurm. It mentions the possibility of assigning and deassigning
nodes to a job, which is exactly what I need. However, the tutorial refers to the FAQ for more detailed information.<br>
> <br>
> 2.<br>
> <br>
> Stack Overflow Question: I also came across a related question on<br>
> Stack Overflow<br>
> (<a href=""></a>https://stackoverflow.com/questions/49398201/how-to-update-job-node-number-in-slurm <<a href="https://stackoverflow.com/questions/49398201/how-to-update-job-node-number-in-slurm">https://stackoverflow.com/questions/49398201/how-to-update-job-node-number-in-slurm</a>>)
that discusses updating the node number for a job in Slurm. The answer suggests that it is indeed possible, but again, it refers to the FAQ for further details.<br>
> <br>
> Upon reviewing the current FAQ, I found that it states node shrinking is <br>
> only possible for pending jobs. Unfortunately, it does not provide <br>
> additional information or examples to clarify if this functionality can <br>
> be extended to running jobs.<br>
> <br>
> I would be grateful if anyone could provide insight into the following:<br>
> <br>
> 1.<br>
> <br>
> Is it possible to dynamically shrink or expand nodes for running<br>
> jobs in Slurm? If so, how can it be achieved?<br>
> <br>
> 2.<br>
> <br>
> Are there any alternative methods or workarounds to accomplish<br>
> dynamic node scaling for running jobs in Slurm?<br>
> <br>
> I kindly request your guidance, personal experiences, or any relevant <br>
> resources that could shed light on this topic. Your expertise and <br>
> assistance would greatly help me in successfully completing my project.<br>
> <br>
> Thank you in advance for your time and support.<br>
> <br>
> Best regards,<br>
> <br>
> <br>
> Maysam<br>
> <br>
> <br>
> Johannes Gutenberg University of Mainz<br>
> <br>
> <br>
<br>
-- <br>
Diego Zuccato<br>
DIFA - Dip. di Fisica e Astronomia<br>
Servizi Informatici<br>
Alma Mater Studiorum - Università di Bologna<br>
V.le Berti-Pichat 6/2 - 40127 Bologna - Italy<br>
tel.: +39 051 20 95786<br>
<br>
</div>
</span></font>
</body>
</html>