[slurm-users] job_container/tmpfs and autofs

Ole Holm Nielsen Ole.H.Nielsen at fysik.dtu.dk
Thu Jan 12 08:29:48 UTC 2023


Hi Magnus,

We had the same challenge some time ago.  A long description of solutions 
is in my Wiki page at 
https://wiki.fysik.dtu.dk/Niflheim_system/Slurm_configuration/#temporary-job-directories

The issue may have been solved in 
https://bugs.schedmd.com/show_bug.cgi?id=12567 which will be in Slurm 23.02.

At this time, the auto_tmpdir SPANK plugin seems to be the best solution.

IHTH,
Ole

On 1/12/23 08:49, Hagdorn, Magnus Karl Moritz wrote:
> Hi there,
> we excitedly found the job_container/tmpfs plugin which neatly allows
> us to provide local scratch space and a way of ensuring that /dev/shm
> gets cleaned up after a job finishes. Unfortunately we found that it
> does not play nicely with autofs which we use to provide networked
> project and scratch directories. We found that this is a known issue
> [1]. I was wondering if that has been solved? I think it would be
> really useful to have a warning about this issue in the documentation
> for the job_container/tmpfs plugin.
> Regards
> magnus
> 
> [1]
> https://cernvm-forum.cern.ch/t/intermittent-client-failures-too-many-levels-of-symbolic-links/156/4

-- 
Ole Holm Nielsen
PhD, Senior HPC Officer
Department of Physics, Technical University of Denmark



More information about the slurm-users mailing list