[slurm-users] Running two multiprocessing jobs in one sbatch
Brian Andrus
toomuchit at gmail.com
Sat Jul 25 20:29:59 UTC 2020
Is there a reason to run them as a single job?
It may be easier to just have 2 separate jobs of 16 cores each.
If there are dependency requirements, that is addressed by adding any
dependencies to the job submission.
Brian Andrus
On 7/25/2020 2:50 AM, Даниил Вахрамеев wrote:
> Hi everyone!
>
> I have SLURM cluster with several nodes with 16 vcpus per node. I've
> tried to run the following code:
>
> |#SBATCH --nodes 2 #SBATCH --ntasks 2 #SBATCH -c 16 srun --exclusive
> --nodes=1 program1 & srun --exclusive --nodes=1 program2 & wait |
>
> |program1| and |program2| needs 16cpus each and I expected that 2
> nodes with 32 cores would be allocated and |program1| would be ran on
> the first node and |program2| on the second one, but I got the
> following error message:
>
> |srun: error: Unable to create step for job 364966: Requested node
> configuration is not available |
>
> If I use only |--nodes| and |--ntasks| keys, sbatch allocates 2 nodes
> with 2 cpus and if I use |--nodes| and |-c| options, I get message
> that |--ntasks| should be defined.
>
> If I set |--ntasks=1|, SLURM set nnodes to 1.
>
> How can I run this two programs in one batch, each on one node and 16
> vcpus?
>
> ------
>
> Kind regards,
>
> Daniil Vakhrameev
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20200725/f56d96bb/attachment.htm>
More information about the slurm-users
mailing list