[slurm-users] Checking allocated GRES? SLURM 16.05.x

Ryan Novosielski novosirj at rutgers.edu
Mon Feb 5 14:22:36 MST 2018


> On Feb 5, 2018, at 4:11 PM, Kilian Cavalotti <kilian.cavalotti.work at gmail.com> wrote:
> 
> Hi Ryan,
> 
> On Mon, Feb 5, 2018 at 8:06 AM, Ryan Novosielski <novosirj at rutgers.edu> wrote:
>> We currently use SLURM 16.05.10 and one of our staff asked how they
>> can check for allocated GPUs, as you might check allocated CPUs by
>> doing scontrol show node. I could have sworn that you can see both,
>> but I see that only CPUs is visible. One of our staff recommended
>> using sacct to see it. Is there a better way?
> 
>    scontrol -d show node <nodename>
> 
> will display a GresUsed line, that shows how many of those Gres are
> allocated, if that's what you're looking for. You'll get a count, but
> not individual ids, AFAIK.

Exactly what we were looking for, thank you.

PS: Anyone know what GresDrain is? Sounds like a feature we’ve been looking for but believed not to exist (offlining of some number of GRES). I’m assuming it’s there but not implemented?

[novosirj at perceval1 install-files]$ scontrol -d show node cuda001
NodeName=cuda001 Arch=x86_64 CoresPerSocket=12
   CPUAlloc=24 CPUErr=0 CPUTot=24 CPULoad=24.03
   AvailableFeatures=(null)
   ActiveFeatures=(null)
   Gres=gpu:4
   GresDrain=N/A
   GresUsed=gpu:4
   NodeAddr=cuda001 NodeHostName=cuda001 Version=16.05
   OS=Linux RealMemory=128241 AllocMem=124000 FreeMem=105469 Sockets=2 Boards=1
   State=ALLOCATED ThreadsPerCore=1 TmpDisk=0 Weight=4 Owner=N/A MCS_label=N/A
   BootTime=2018-01-27T23:01:42 SlurmdStartTime=2018-01-28T00:17:45
   CapWatts=n/a
   CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
   ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s

--
____
|| \\UTGERS,  	 |---------------------------*O*---------------------------
||_// the State	 |         Ryan Novosielski - novosirj at rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
||  \\    of NJ	 | Office of Advanced Research Computing - MSB C630, Newark
     `'

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 236 bytes
Desc: Message signed with OpenPGP
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20180205/90da6902/attachment.sig>


More information about the slurm-users mailing list