[slurm-users] scancel problem

sysadmin.caos sysadmin.caos at uab.cat
Fri Sep 21 05:49:34 MDT 2018


Try running with "srun", not "mpirun"

> Hello everybody,
> i submit a job with sbatch command (sbatch myprog.sh).  My prog.sh is
> =============================================================
> #!/bin/bash
> #SBATCH --partition=part2
> #SBATCH --ntasks=20
> #SBATCH --nodelist=
> #SBATCH --cpus-per-task=1
> #SBATCH --mem=
> # Memory per node specification is in MB. It is optional.
> # The default limit is 3000MB per core.
> #SBATCH --job-name="test"
> #SBATCH --output=test.output
> #SBATCH --mail-user=test at out.gr
> #SBATCH --mail-type=ALL
>
> mpirun -c 20 /home/me/projects/EXP00/opa
> =============================================================
>
> The submmited id is 5402. When i  cancel the job by the command  "scancel
> 5402"  i notice that the job is deleted from the squeue ( the job is not
> shown in squeue) but making an htop at the node where it was running i see
> that it continues to be running....
> Moreover, another user submiited his job, which was allocated at the same
> node ....  The node has 20 cores...
> What is happenning here?
>
> Slurm Version slurm 16.05.9




More information about the slurm-users mailing list