[slurm-users] Cannot run interactive jobs

Sajesh Singh ssingh at amnh.org
Wed Mar 25 06:21:50 UTC 2020


CentOS 7.7.1908
Slurm 18.08.8

When trying to run an interactive job I am getting the following error:

srun: error: task 0 launch failed: Slurmd could not connect IO

Checking the log file on the compute node I see the following error:

[2020-03-25T01:42:08.262] launch task 13.0 request from UID:1326 GID:50000 HOST:192.168.229.254 PORT:14980
[2020-03-25T01:42:08.262] lllp_distribution jobid [13] implicit auto binding: cores,one_thread, dist 8192
[2020-03-25T01:42:08.262] _task_layout_lllp_cyclic
[2020-03-25T01:42:08.262] _lllp_generate_cpu_bind jobid [13]: mask_cpu,one_thread, 0x0000000000000001
[2020-03-25T01:42:08.262] _run_prolog: run job script took usec=5
[2020-03-25T01:42:08.262] _run_prolog: prolog with lock for job 13 ran for 0 seconds
[2020-03-25T01:42:08.272] [13.0] Considering each NUMA node as a socket
[2020-03-25T01:42:08.310] [13.0] error: stdin openpty: Operation not permitted
[2020-03-25T01:42:08.311] [13.0] error: IO setup failed: Operation not permitted
[2020-03-25T01:42:08.311] [13.0] error: job_manager exiting abnormally, rc = 4021
[2020-03-25T01:42:08.315] [13.0] done with job

When doing the same on a CentOS 7.3 and Slurm 18.08.4 cluster the interactive job runs as expected.

Any advise on how to remedy this would be appreciated.

-Sajesh-




-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20200325/083ba437/attachment-0001.htm>


More information about the slurm-users mailing list