[slurm-users] ReqGRES value is not valid

Uemoto, Tomoki fj2770fj at aa.jp.fujitsu.com
Fri Oct 4 05:49:18 UTC 2019


Thank you for your reply

I'm sorry there were some mistakes.

# srun --version
slurm 18.08.6
#

$ cat gresgpu.sh
#!/bin/bash

#SBATCH -J gresgpu            # Job name
#SBATCH --gres=gpu:2
#SBATCH -o job.%j.out         # Name of stdout output file (%j expands to jobId)

srun sleep 60
$

o gres.conf
  Name=gpu File=/dev/tty0 CPUs=0,1
  Name=gpu File=/dev/tty1 CPUs=25,26

o slurm.conf
  TaskPlugin=task/affinity,task/cgroup
 GresTypes=gpu
 NodeName=sl02-c00[1-2] Gres=gpu:2

o cgroup.conf
  CgroupAutomount=yes
  ConstrainCores=yes
  ConstrainDevices=yes

$ sbatch gresgpu.sh
Submitted batch job 76

$ squeue -l
Fri Oct  4 14:43:15 2019
             JOBID PARTITION     NAME     USER    STATE       TIME TIME_LIMI  NODES NODELIST(REASON)
                76    normal  gresgpu     test  RUNNING       0:01 1-00:00:00      1 c001

$ sacct -j 76 -a -X -o jobid,partition,ReqGRES,AllocGRES,stat
       JobID  Partition      ReqGRES    AllocGRES      State
------------ ---------- ------------ ------------ ----------
76               normal        gpu:0        gpu:2    RUNNING
$

Regards, 
Tomo



-----Original Message-----
From: slurm-users [mailto:slurm-users-bounces at lists.schedmd.com] On Behalf Of Chris Samuel
Sent: Friday, October 04, 2019 2:38 PM
To: slurm-users at lists.schedmd.com
Subject: Re: [slurm-users] ReqGRES value is not valid

On 3/10/19 10:23 pm, Uemoto, Tomoki wrote:

> I don't know why it return value of ReqGres is 0.

Which version of Slurm are you on?

Also there looks to be a typo, you've got "prun" not "srun" in your batch script.

All the best,
Chris
--
  Chris Samuel  :  http://www.csamuel.org/  :  Berkeley, CA, USA



More information about the slurm-users mailing list