Hello, When I am running this command: `salloc --nodelist=gpu03 -p A4500_Features --gres=gpu:1` and then automatically ssh to the job, what should I see when I run nvidia-smi? All the GPUs in the host or just a single one? Thanks
Hello Oren,
On 8/5/24 3:20 PM, Oren via slurm-users wrote:
When I am running this command: `salloc --nodelist=gpu03 -p A4500_Features --gres=gpu:1` and then automatically ssh to the job, what should I see when I run nvidia-smi? All the GPUs in the host or just a single one?
That should depend on the ConstrainDevices parameter. [1] You can quickly verify this with:
$ scontrol show conf | grep Constr
1. https://slurm.schedmd.com/cgroup.conf.html#OPT_ConstrainDevices
Best,
Hi James, I am sort of the admin and trying to understand what the goal should be. Thanks Roberto, I'll have a look on ConstrainDevices https://slurm.schedmd.com/cgroup.conf.html#OPT_ConstrainDevices
On Mon, 5 Aug 2024 at 18:14, Roberto Polverelli Monti via slurm-users < slurm-users@lists.schedmd.com> wrote:
Hello Oren,
On 8/5/24 3:20 PM, Oren via slurm-users wrote:
When I am running this command: `salloc --nodelist=gpu03 -p A4500_Features --gres=gpu:1` and then automatically ssh to the job, what should I see when I run nvidia-smi? All the GPUs in the host or just a single one?
That should depend on the ConstrainDevices parameter. [1] You can quickly verify this with:
$ scontrol show conf | grep Constr
Best,
-- Roberto Polverelli Monti HPC Engineer Do IT Now | https://doit-now.tech/
-- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-leave@lists.schedmd.com