[slurm-users] Execute scripts on suspend and cancel

Goetz, Patrick G pgoetz at math.utexas.edu
Thu Oct 17 16:50:29 UTC 2019


Are applications even aware when they've been hit by a SIGSTP?  This 
idea of a license being released under these circumstances just seems 
very unlikely.

On 10/15/19 1:57 PM, Brian Andrus wrote:
> It seems that there are some details that would need addressed.
> 
> A suspend signal is nothing more than sending a SIGSTP (like hitting 
> ctrl-s), so the application is still in memory awaiting SIGCONT
> 
> So what should happen when it continues and there are no more licenses? 
> So the proper place for what you are looking for is in the application 
> itself. If it is given a SIGSTP, it could release the licenses and then 
> check them out again when SIGCONT is received.
> 
> If you are able to tell your app to release/request a license 
> externally, you may want to have a wrapper to do the signal handling 
> until they have it as part of their app.
> 
> Brian Andrus
> 
> 
> On 10/14/2019 4:40 AM, Oytun Peksel wrote:
>>
>> It is quite weird if slurm has no mechanism as described. I have been 
>> digging more into it and someone suggested a workaround using mail 
>> notifications. You use a script instead of the mail application and 
>> catch the event then use use sacct to see what is happening.
>>
>> Two problems with this:
>>
>> ·There is no mail sent with suspended preemption
>>
>> ·If you use requeue instead there will be a mail event and you can 
>> catch it. Sacct will flag it as “preempted” so you know it is 
>> requeued. But then it would change it pending. So you really need to 
>> be quick to catch it. Also there is no distinctive flag for resuming.
>>
>> Anyone has any other method to execute scripts during preemption?
>>
>>
>>
>> *Oytun Peksel*
>>
>> oytun.peksel at semcon.com <mailto:oytun.peksel at semcon.com>
>>
>> 	
>> 	
>>
>> +46739205917
>>
>> 	
>> 	
>>
>> *From:*slurm-users <slurm-users-bounces at lists.schedmd.com> *On Behalf 
>> Of *Oytun Peksel
>> *Sent:* den 11 oktober 2019 09:10
>> *To:* slurm-users at lists.schedmd.com
>> *Subject:* [slurm-users] Execute scripts on suspend and cancel
>>
>> Hi,
>>
>> I was wondering is there an option in Slurm to execute custom scripts 
>> before Suspend signal.  What I need to do is to tell an application to 
>> release it’s licenses before sending the suspend signal during 
>> preemption. I think went through all the documentation but could not 
>> find a mechanism like this.
>>
>> BR
>>
>> /Oytun
>>
>>
>>
>> /When you communicate with us or otherwise interact with Semcon, we 
>> will process personal data that you provide to us or we collect about 
>> you, please read more in our Privacy Policy 
>> <https://semcon.com/data-privacy-policy/>./
>>



More information about the slurm-users mailing list