[slurm-users] Nodes not returning from DRAINING
Christopher Samuel
chris at csamuel.org
Wed Oct 28 18:58:45 UTC 2020
On 10/28/20 6:27 am, Diego Zuccato wrote:
> Strangely the core file seems corrupted (maybe because it's from a
> 4-nodes job and they all try to write to the same file?):
You can set a pattern for core file names to prevent that, usually the
PID is in the name, but you can put the hostname in there too.
https://man7.org/linux/man-pages/man5/core.5.html
See the section: "Naming of core dump files"
All the best,
Chris
--
Chris Samuel : http://www.csamuel.org/ : Berkeley, CA, USA
More information about the slurm-users
mailing list