[slurm-users] Problems calling mpirun in OpenMPI-3.1.6 + slurm and OpenMPI-4.0.3+slurm environments

Jeffrey T Frey frey at udel.edu
Fri Apr 10 17:00:57 UTC 2020


I just reread your post -- you installed Open MPI 4.0.3 to /home/manumachu/openmpi-4.0.3/OPENMPI_INSTALL then set what's probably a different directory -- /scratch/manumachu/openmpi-4.0.3/OPENMPI_INSTALL/bin -- on your path.  So I bet "which mpirun" won't show you what you're expecting :-)




> On Apr 10, 2020, at 12:59 , Jeffrey T Frey <frey at udel.edu> wrote:
> 
> Are you certain you're PATH addition is correct?  The "-np" flag is still present in a build of Open MPI 4.0.3 I just made, in fact:
> 
> 
> $ 4.0.3/bin/mpirun 
> --------------------------------------------------------------------------
> mpirun could not find anything to do.
> 
> It is possible that you forgot to specify how many processes to run
> via the "-np" argument.
> --------------------------------------------------------------------------
> 
> 
> Note that with the Slurm plugins present in your Open MPI build, there should be no need to use "-np" on the command line; the Slurm RAS plugin should pull such information from the Slurm runtime environment variables.  If you do use "-np" to request more CPUs that the job was allocated, you'll receive oversubscription errors (you know, unless you include mpirun flags to allow that to happen).
> 
> 
> What if you add "which mpirun" to your job script ahead of the "mpirun" command -- does it show you /scratch/manumachu/openmpi-4.0.3/OPENMPI_INSTALL/bin/mpirun?
> 
> 
> 
> 
>> On Apr 10, 2020, at 12:12 , Ravi Reddy Manumachu <ravi.manumachu at ucd.ie> wrote:
>> 
>> 
>> Dear Slurm Users,
>> 
>> I am facing issues with the following combinations of OpenMPI and SLURM. I was wondering if you have faced something similar and can help me.
>> 
>> OpenMPI-3.1.6 and slurm 19.05.5
>> OpenMPI-4.0.3 and slurm 19.05.5
>> 
>> I have the OpenMPI packages configured with "--with-slurm" option and installed. 
>> 
>>   Configure command line: '--prefix=/home/manumachu/openmpi-4.0.3/OPENMPI_INSTALL' '--with-slurm'
>>                  MCA ess: slurm (MCA v2.1.0, API v3.0.0, Component v4.0.3)
>>                  MCA plm: slurm (MCA v2.1.0, API v2.0.0, Component v4.0.3)
>>                  MCA ras: slurm (MCA v2.1.0, API v2.0.0, Component v4.0.3)
>>               MCA schizo: slurm (MCA v2.1.0, API v1.0.0, Component v4.0.3)
>> 
>> I am executing the sbatch script shown below:
>> 
>> #!/bin/bash
>> #SBATCH --account=xxxxx
>> #SBATCH --job-name=ompi4
>> #SBATCH --output=ompi4.out
>> #SBATCH --error=ompi4.err
>> #SBATCH --ntasks-per-node=1
>> #SBATCH --time=00:30:00
>> export PATH=/scratch/manumachu/openmpi-4.0.3/OPENMPI_INSTALL/bin:$PATH
>> export LD_LIBRARY_PATH=/scratch/manumachu/openmpi-4.0.3/OPENMPI_INSTALL/lib:$LD_LIBRARY_PATH
>> mpirun -np 4 ./bcast_timing -t 1
>> 
>> No matter what option I give to mpirun, I get the following error:
>> mpirun: Error: unknown option "-np"
>> 
>> I have used mpiexec also but received the same errors.
>> 
>> To summarize, I am not able to call mpirun from a SLURM script. I can use srun but I have no idea how to pass MCA parameters I usually give to mpirun such as, "--map-by ppr:1:socket -mca pml ob1 -mca btl tcp,self -mca coll_tuned_use_dynamic_rules 1".
>> 
>> Thank you for your help.
>> 
>> -- 
>> Kind Regards
>> Dr. Ravi Reddy Manumachu
>> Research Fellow, School of Computer Science, University College Dublin
>> Ravi Manumachu on Google Scholar, ResearchGate
> 




More information about the slurm-users mailing list