Hi Chris,
Thanks for confirming that UnkillableStepTimeout can have larger values without issues. Do you have some suggestions for values that would safely cover network filesystem delays?
Best regards, Ole
On 10/24/24 07:51, Christopher Samuel via slurm-users wrote:
Some time ago it was recommended that UnkillableStepTimeout values above 127 (or 256?) should not be used, see https://support.schedmd.com/ show_bug.cgi?id=11103. I don't know if this restriction is still valid with recent versions of Slurm?
As I read it that last comment includes a commit message for the fix to that problem, and we happily use a much longer timeout than that without apparent issue.