[slurm-users] [External] How can I do to prevent a specific job from being prempted?
Brian Andrus
toomuchit at gmail.com
Thu Sep 16 16:19:58 UTC 2021
Modify it and raise the priority to something very, very high.
scontrol update job=JOBID priority=9999999
Brian Andrus
On 9/16/2021 8:39 AM, 顏文 wrote:
> Dear users
>
> Thank for the immediate replies.I currently have one important job
> running. How to prevent the running job from being preempted? That job
> has been running for two weeks, and I am afraid it will be terminated
> in the next few days. Can we do anything to secure the job from being
> cancelled and preempted? Thanks.
>
> regards,
> Peter
>
> Russell Jones <arjones85 at gmail.com <mailto:arjones85 at gmail.com>> 於
> 2021年9月14日 週二 下午10:52寫道:
>
> The other option is creating a "special" partition that only this
> user(s) can submit to, where jobs running in that partition have a
> higher priority than all the others (if you are using partition
> priority like we are).
>
> On Tue, Sep 14, 2021 at 3:26 AM Loris Bennett
> <loris.bennett at fu-berlin.de <mailto:loris.bennett at fu-berlin.de>>
> wrote:
>
> Dear Peter,
>
> 顏文 <peteryen2017 at gmail.com <mailto:peteryen2017 at gmail.com>>
> writes:
>
> > Dear Mr. Zillner
> >
> > I would like the specific running job not being rescheduled
> , but also can not be terminated or cancelled in any way. If
> the job is cancelled, I need to start it over again. Normally
> this kind of jobs require weeks to
> > finish. So the time costs it take to restart is quite
> significant. Can your suggestions also achieve that? Thanks.
>
> I have not tried this, but could you define a QOS which cannot be
> preempted by the other QOSs? That way you can also restrict
> access to
> the QOS, since a non-preemptable QOS will probably be more
> attractive
> than a preemptable one. You would then obviously have to
> explain to the
> users which QOS they should use.
>
> Cheers,
>
> Loris
>
> > with regards,
> > Peter
> >
> > Florian Zillner <fzillner at lenovo.com
> <mailto:fzillner at lenovo.com>> 於 2021年9月14日 週二 下午3:40寫道:
> >
> > See the no-requeue option for SBATCH:
> >
> > --no-requeue
> > Specifies that the batch job should never be requeued under
> any circumstances. Setting this option will prevent system
> administrators from being able to restart the job (for
> example, after a scheduled
> > downtime), recover from a node failure, or be requeued upon
> preemption by a higher priority job. When a job is requeued,
> the batch script is initiated from its beginning. Also see the
> --requeue option. The
> > JobRequeue configuration parameter controls the default
> behavior on the cluster.
> >
> > https://slurm.schedmd.com/sbatch.html
> <https://slurm.schedmd.com/sbatch.html>
> >
> > Get Outlook for Android
> >
> --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
> > From: slurm-users <slurm-users-bounces at lists.schedmd.com
> <mailto:slurm-users-bounces at lists.schedmd.com>> on behalf of
> 顏文 <peteryen2017 at gmail.com <mailto:peteryen2017 at gmail.com>>
> > Sent: Tuesday, September 14, 2021 7:02:13 AM
> > To: slurm-users at lists.schedmd.com
> <mailto:slurm-users at lists.schedmd.com>
> <slurm-users at lists.schedmd.com
> <mailto:slurm-users at lists.schedmd.com>>
> > Subject: [External] [slurm-users] How can I do to prevent a
> specific job from being prempted?
> >
> > Dear slurm users,
> >
> > I have some specific jobs that can't be terminated,
> otherwise they need to be rerun from the beginning. Can we
> simply apply some settings (either by user or administrator)
> so that these jobs will not be
> > preempted ? Thanks.
> >
> > with regards,
> > Peter
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20210916/8cc86b62/attachment.htm>
More information about the slurm-users
mailing list