<html><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class=""><div class="">• <b class="">GraceTime</b>: Specifies a time period for a job to execute after it is selected to be preempted. This option can be specified by partition or QOS using the slurm.conf file or database respectively. This option is only honored if PreemptMode=CANCEL. The GraceTime is specified in seconds and the default value is zero, which results in no preemption delay. Once a job has been selected for preemption, its end time is set to the current time plus GraceTime. The job is immediately sent SIGCONT and SIGTERM signals in order to provide notification of its imminent termination. This is followed by the SIGCONT, SIGTERM and SIGKILL signal sequence upon reaching its new end time.</div><div class=""><br class=""></div><div class=""><br class=""></div><div class="">"The job is immediately sent SIGCONT and SIGTERM signals in order to provide notification of its imminent termination."</div><div class=""><br class=""></div><div class=""><br class=""></div><div class="">Default behavior on SIGTERM is for a program to exit; your program is probably ending when it receives that initial SIGTERM.</div><div class=""><br class=""></div><div class=""><br class=""></div><div class=""><br class=""></div><div class=""><br class=""></div><div class=""><br class=""></div><div class=""><br class=""></div><blockquote type="cite" class="">On Nov 20, 2017, at 10:21 AM, Ailing Zhang <<a href="mailto:zhangal1992@gmail.com" class="">zhangal1992@gmail.com</a>> wrote:<br class=""><br class=""><br class="">Hi slurm community,<br class=""><br class="">I'm testing preemption with partition based preemption. Partitions test-high and test-low share the same nodes. I set GraceTime=600 and PreemptMode=CANCEL in test-low. But once I submitted a job to test-high, job in test-low is immediately killed without any grace time. <br class="">Here is my configs.<br class="">PartitionName=test-low<br class=""> AllowGroups=admins AllowAccounts=ALL AllowQos=ALL<br class=""> AllocNodes=ALL Default=NO QoS=N/A<br class=""> DefaultTime=NONE DisableRootJobs=NO ExclusiveUser=NO GraceTime=600 Hidden=NO<br class=""> MaxNodes=UNLIMITED MaxTime=02:00:00 MinNodes=1 LLN=NO MaxCPUsPerNode=UNLIMITED<br class=""> Nodes=node[100-102]<br class=""> PriorityJobFactor=10 PriorityTier=10 RootOnly=NO ReqResv=NO OverSubscribe=NO<br class=""> OverTimeLimit=NONE PreemptMode=CANCEL<br class=""> State=UP TotalCPUs=100 TotalNodes=3 SelectTypeParameters=NONE<br class=""> DefMemPerNode=UNLIMITED MaxMemPerNode=UNLIMITED<br class=""><br class="">PartitionName=test-high<br class=""> AllowGroups=admins AllowAccounts=ALL AllowQos=ALL<br class=""> AllocNodes=ALL Default=NO QoS=N/A<br class=""> DefaultTime=NONE DisableRootJobs=NO ExclusiveUser=NO GraceTime=0 Hidden=NO<br class=""> MaxNodes=UNLIMITED MaxTime=02:00:00 MinNodes=1 LLN=NO MaxCPUsPerNode=UNLIMITED<br class=""> Nodes=node[100-102] PriorityJobFactor=30 PriorityTier=30 RootOnly=NO ReqResv=NO OverSubscribe=NO<br class=""> OverTimeLimit=NONE PreemptMode=OFF<br class=""> State=UP TotalCPUs=100 TotalNodes=3 SelectTypeParameters=NONE<br class=""> DefMemPerNode=UNLIMITED MaxMemPerNode=UNLIMITED<br class=""><br class="">Any help will be much appreciated.<br class=""><br class="">Thanks!<br class="">Ailing<br class=""></blockquote><br class=""><div class=""><br class="">::::::::::::::::::::::::::::::::::::::::::::::::::::::<br class="">Jeffrey T. Frey, Ph.D.<br class="">Systems Programmer V / HPC Management<br class="">Network & Systems Services / College of Engineering<br class="">University of Delaware, Newark DE 19716<br class="">Office: (302) 831-6034 Mobile: (302) 419-4976<br class="">::::::::::::::::::::::::::::::::::::::::::::::::::::::<br class=""><br class=""><br class=""><br class=""></div><br class=""></body></html>