[slurm-users] Heterogeneous GPU Node MPS

Holger Badorreck h.badorreck at lzh.de
Fri Nov 13 12:23:19 UTC 2020


Hello,

 

I have a heterogeneous GPU Node with one V100 and two RTX cards. When I
request resources with --gres=mps:100, always the V100 is chosen, and jobs
are waiting if the V100 is completely allocated, while RTX cards are free.
If I use --gres=gpu:1, also the RTX cards are used. Is something wrong with
the configuration or is it another problem?

 

The node configuration  in slurm.conf:

NodeName=node1 CPUs=48 RealMemory=128530 Sockets=1 CoresPerSocket=24
ThreadsPerCore=2 Gres=gpu:v100:1,gpu:rtx:2,mps:600 State=UNKNOWN

 

gres.conf:

Name=gpu Type=v100      File=/dev/nvidia0

Name=gpu Type=rtx          File=/dev/nvidia1

Name=gpu Type=rtx          File=/dev/nvidia2

Name=mps Count=200      File=/dev/nvidia0

Name=mps Count=200      File=/dev/nvidia1

Name=mps Count=200      File=/dev/nvidia2

 

Best regards,

Holger

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20201113/d5cd3f5f/attachment.htm>


More information about the slurm-users mailing list