[slurm-users] Fwd: task/cgroup plugin causes "srun: error: task 0 launch failed: Plugin initialization failed" error on Ubuntu 22.04
Tim Schneider
tim.schneider1 at tu-darmstadt.de
Thu Jun 15 22:04:09 UTC 2023
Hi,
I am maintaining the SLURM cluster of my research group. Recently I
updated to Ubuntu 22.04 and Slurm 21.08.5 and ever since, I am unable to
launch jobs. When launching a job, I receive the following error:
/$ srun --nodes=1 --ntasks-per-node=1 -c 1 --mem-per-cpu 1G
--time=01:00:00 --pty -p amd -w cn02 --pty bash -i//
//srun: error: task 0 launch failed: Plugin initialization failed/
Strangely, I cannot find any indication of this problem in the logs
(find the logs attached). The problem must be related to the task/cgroup
plugin, as it does not occur when I disable it.
After reading in the documentation, I tried adding the
/cgroup_enable=memory swapaccount=1/ kernel parameters, but the problem
persisted.
I would be very grateful for any advice where to look since I have no
idea how to investigate this issue further.
Thanks a lot in advance.
Best,
Tim
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20230616/e3d56d72/attachment-0001.htm>
-------------- next part --------------
###
# Slurm cgroup support configuration file
###
CgroupAutomount=yes
CgroupMountpoint=/sys/fs/cgroup
ConstrainKmemSpace=no
ConstrainCores=yes
ConstrainRAMSpace=yes
ConstrainSwapSpace=yes
# This will be necessary for controlling GPU access
ConstrainDevices=yes
#
-------------- next part --------------
A non-text attachment was scrubbed...
Name: slurmd.log
Type: text/x-log
Size: 9194 bytes
Desc: not available
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20230616/e3d56d72/attachment-0002.bin>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: slurmctld.log
Type: text/x-log
Size: 56551 bytes
Desc: not available
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20230616/e3d56d72/attachment-0003.bin>
More information about the slurm-users
mailing list