[slurm-users] 17.11+auks+cgroups: finished jobs hang in completing state

Robbert Eggermont R.Eggermont at tudelft.nl
Mon Mar 26 03:50:21 MDT 2018


FYI:

I think we've run into this issue: 
https://github.com/hautreux/auks/issues/24

It seems to be triggered by a change in signal blocking in slurmstepd:
https://github.com/SchedMD/slurm/commit/d2c83807097605f10f0b19cf2c5cb5c2c6f35ad6

The suggest fix (use sigkill instead of sigterm in slurm_spank_auks to 
stop auks) seems to work (so far).

-- 
Robbert Eggermont
Intelligent Systems Support & Data Steward | TU Delft
+31 15 27 83234 | Building 28, Floor 5, Room W660
Available Mon, Wed-Fri



More information about the slurm-users mailing list