3
3
1
0
3
5
4
14
14 Apr '25
2
2
1
0
4
4
1
0
Re: slurmctld HA ; backup controller doesn't schedule and start any job
by Hiromasa Watanabe 10 Apr '25
by Hiromasa Watanabe 10 Apr '25
10 Apr '25
1
0
2
2
1
0
3
4
slurmctld HA ; backup controller doesn't schedule and start any job
by hiromasa.watanabe@gmail.com 09 Apr '25
by hiromasa.watanabe@gmail.com 09 Apr '25
09 Apr '25
1
0
1
0
2
1
Run a command in Slurm with all streams and signals connected to the submitting command
by Michael Milton 06 Apr '25
by Michael Milton 06 Apr '25
06 Apr '25
2
2
Re: Run a command in Slurm with all streams and signals connected to the submitting command
by Michael Milton 04 Apr '25
by Michael Milton 04 Apr '25
04 Apr '25
1
0
3
2
1
0
2
1
31 Mar '25
4
5
3
4
3
10
2
1
bit_cache_init failure on the second time backup controller tries to take control
by Safdar Iqbal 27 Mar '25
by Safdar Iqbal 27 Mar '25
27 Mar '25
1
0
1
0
1
0
4
3
3
2
1
0
3
2
1
0
1
0
1
0
2
1
2
1
1
0
1
0
4
7
allowing spyder-kernels to an interactive session without pam_slurm_adopt and older version of Slurm from OpenHPC repo; parmiko?
by Robert Kudyba 06 Mar '25
by Robert Kudyba 06 Mar '25
06 Mar '25
1
0
1
0
broken SLURM-PMIX out-of-band communication on v24.11.0 with PMIx v5
by Bertini, Denis Dr. 06 Mar '25
by Bertini, Denis Dr. 06 Mar '25
06 Mar '25
1
0
1
0
1
1
4
6
1
0
2
1
7
6
Re: how to set slurmdbd.conf if using two slurmdb node with HA database?
by taleintervenor@sjtu.edu.cn 27 Feb '25
by taleintervenor@sjtu.edu.cn 27 Feb '25
27 Feb '25
4
10
3
2
1
0
1
0
Plese help [CPUs=24 Boards=1 SocketsPerBoard=1 CoresPerSocket=16 ThreadsPerCore=1]
by Hugo Solís 23 Feb '25
by Hugo Solís 23 Feb '25
23 Feb '25
2
2
3
5
4
3
1
0
4
4
how to set slurmdbd.conf if using two slurmdb node with HA database?
by taleintervenor@sjtu.edu.cn 19 Feb '25
by taleintervenor@sjtu.edu.cn 19 Feb '25
19 Feb '25
2
1
3
2
3
3
3
3
1
0
1
0
2
1
1
0
4
5
2
1
05 Feb '25
2
2
1
0
8
24
2
1
2
1
1
0
1
0
Behavior of 'afterok' in cloud clusters
by Thompson, Hoot (GSFC-606.0)[ADNET SYSTEMS INC] 30 Jan '25
by Thompson, Hoot (GSFC-606.0)[ADNET SYSTEMS INC] 30 Jan '25
30 Jan '25
1
0
2
2
28 Jan '25
1
0
1
0
3
4
1
0
3
2
1
1
2
1
1
0
2
1
2
1
2
1
11 Jan '25
2
1
1
0
3
3
4
3
2
2
6
11
3
4
3
2
Nodes required for job are DOWN, DRAINED or reserved for jobs in higher priority partitions
by sportlecon sportlecon 07 Jan '25
by sportlecon sportlecon 07 Jan '25
07 Jan '25
4
3
3
2
1
0
2
1
1
0
2
3
1
1
1
0
2
1
Node configuration unavailable when using --mem-per-gpu , for specific GPU type
by Matthew R. Baney 13 Dec '24
by Matthew R. Baney 13 Dec '24
13 Dec '24
1
0
1
0
1
0
2
2
3
5
2
2
2
4
2
1
Why is my job killed when ResumeTimeout is reached instead of it being requeued?
by Xaver Stiensmeier 09 Dec '24
by Xaver Stiensmeier 09 Dec '24
09 Dec '24
1
1
3
5
3
2
4
6
3
3
How can I make sure my user have only one job per node (Job array --exclusive=user,)
by Oren 03 Dec '24
by Oren 03 Dec '24
03 Dec '24
2
6
1
0
6
9
1
0
1
0
1
0
1
0
2
1
1
0
3
4
1
0
error: Unable to contact slurm controller (connect failure)
by Daniel Rodriguez Lopez (ext) 19 Nov '24
by Daniel Rodriguez Lopez (ext) 19 Nov '24
19 Nov '24
4
3
1
0
3
5
4
3
How to power up all ~idle nodes and verify that they have started up without issue programmatically
by Xaver Stiensmeier 15 Nov '24
by Xaver Stiensmeier 15 Nov '24
15 Nov '24
3
5
5
6
2
2
12 Nov '24
1
0
1
0
12 Nov '24
2
2
3
5
11 Nov '24
1
0
2
2
2
4
3
4
3
2
5
9
Re: 转发: What is the safe upgrade path when upgrade from slurm21.08 and mariadb5.5?
by taleintervenor@sjtu.edu.cn 29 Oct '24
by taleintervenor@sjtu.edu.cn 29 Oct '24
29 Oct '24
1
0
转发: What is the safe upgrade path when upgrade from slurm21.08 and mariadb5.5?
by taleintervenor@sjtu.edu.cn 29 Oct '24
by taleintervenor@sjtu.edu.cn 29 Oct '24
29 Oct '24
3
2
4
4
29 Oct '24
2
1
2
1
2
1
6
9
1
0
1
0
2
1
21 Oct '24
1
0
Tracking costs - one single pool of credits, variable costs per partition
by John Snowdon 18 Oct '24
by John Snowdon 18 Oct '24
18 Oct '24
2
1
5
6
17 Oct '24
1
0
1
0
3
2
6
8
2
2
1
0
08 Oct '24
2
1
07 Oct '24
2
3
1
1
Slurmctld process error 'double free or corruption' on RHEL 9 (Rocky Linux)
by William VINCENT 07 Oct '24
by William VINCENT 07 Oct '24
07 Oct '24
4
15
1
0
2
1
1
0
2
2
1
0
Updated one compute node to Ubuntu 24.04 LTS, now it does not receive jobs
by Cristóbal Navarro 29 Sep '24
by Cristóbal Navarro 29 Sep '24
29 Sep '24
1
1
errors compiling Slurm 18 on RHEL 9: [Makefile:577: scancel] Error 1 & It's not recommended to have unversioned Obsoletes
by Robert Kudyba 27 Sep '24
by Robert Kudyba 27 Sep '24
27 Sep '24
2
2
1
0
3
3
3
6
4
9
2
1
1
2
1
0
2
1
1
0
Can't schedule on cloud node: State=IDLE+CLOUD+POWERED_DOWN+NOT_RESPONDING
by Xaver Stiensmeier 20 Sep '24
by Xaver Stiensmeier 20 Sep '24
20 Sep '24
2
3
2
2
1
0
2
2
1
1
1
0
06 Sep '24
2
1
06 Sep '24
5
8
2
1
05 Sep '24
2
1
2
1
2
3
1
0
Best practices for tracking jobs started across multiple clusters for accounting purposes.
by Di Bernardini, Fabio 02 Sep '24
by Di Bernardini, Fabio 02 Sep '24
02 Sep '24
3
4
1
0
2
5