[slurm-users] Splitting mpi rank output
    Chris Samuel 
    chris at csamuel.org
       
    Thu May 10 01:40:52 MDT 2018
    
    
  
On Thursday, 10 May 2018 2:25:49 AM AEST Christopher Benjamin Coffey wrote:
> I have a user trying to use %t to split the mpi rank outputs into different
> files and it's not working. I verified this too. Any idea why this might
> be? This is the first that I've heard of a user trying to do this.
I think they want to use that as an argument to srun, not sbatch.
I don't know why it doesn't work for sbatch, I'm guessing it doesn't get 
passed on in the environment?  From the look of the srun manual page it 
probably should set SLURM_STDOUTMODE.  But then you'd get both the batch 
output and rank 0 going to the first one.  Seems like a bug to me.
However, I can confirm that it works if you pass it to srun instead.
[csamuel at farnarkle1 tmp]$ cat test-rank.sh
#!/bin/bash
#SBATCH --ntasks=10
#SBATCH --ntasks-per-node=1
srun -o foo-%t.out hostname
[csamuel at farnarkle1 tmp]$ ls -ltr
total 264
-rw-rw-r-- 1 csamuel hpcadmin 89 May 10 17:34 test-rank.sh
-rw-rw-r-- 1 csamuel hpcadmin  0 May 10 17:34 slurm-127420.out
-rw-rw-r-- 1 csamuel hpcadmin  7 May 10 17:34 foo-9.out
-rw-rw-r-- 1 csamuel hpcadmin  7 May 10 17:34 foo-8.out
-rw-rw-r-- 1 csamuel hpcadmin  7 May 10 17:34 foo-7.out
-rw-rw-r-- 1 csamuel hpcadmin  7 May 10 17:34 foo-6.out
-rw-rw-r-- 1 csamuel hpcadmin  7 May 10 17:34 foo-5.out
-rw-rw-r-- 1 csamuel hpcadmin  7 May 10 17:34 foo-4.out
-rw-rw-r-- 1 csamuel hpcadmin  7 May 10 17:34 foo-3.out
-rw-rw-r-- 1 csamuel hpcadmin  7 May 10 17:34 foo-2.out
-rw-rw-r-- 1 csamuel hpcadmin  7 May 10 17:34 foo-1.out
-rw-rw-r-- 1 csamuel hpcadmin  7 May 10 17:34 foo-0.out
[csamuel at farnarkle1 tmp]$ more foo-*
::::::::::::::
foo-0.out
::::::::::::::
john37
::::::::::::::
foo-1.out
::::::::::::::
john38
::::::::::::::
foo-2.out
::::::::::::::
john39
::::::::::::::
foo-3.out
::::::::::::::
john40
::::::::::::::
foo-4.out
::::::::::::::
john41
::::::::::::::
foo-5.out
::::::::::::::
john42
::::::::::::::
foo-6.out
::::::::::::::
john43
::::::::::::::
foo-7.out
::::::::::::::
john44
::::::::::::::
foo-8.out
::::::::::::::
john45
::::::::::::::
foo-9.out
::::::::::::::
john46
Hope that helps,
Chris
-- 
 Chris Samuel  :  http://www.csamuel.org/  :  Melbourne, VIC
    
    
More information about the slurm-users
mailing list