[slurm-users] Distribute the node resources in multiple partitions and regarding job submission script

Purvesh Parmar purveshp0507 at gmail.com
Sun Apr 10 11:57:38 UTC 2022


Hello,

I have been using slurm 21.08.

Q1 : I have 8 nodes with 2 gpus each and 128 cores with 512 GB RAM. I want
to distribute the node resources in 2 partitions so that "par1" partition
will have 2 gpus with 64 cores and 256 GB ram of the node and the other
partition "par 2" will have the remaining  64 cores and remaining 256 gb
ram of the same node.

par1 should be the default partition.

I have used MaxCPUsPerNode and also listed each node in both par1 and par2
.However, while job submission, if i give par2 as the partition name and
use gres:gpu, still the job is getting submitted and is going for run (in
spite of par2 not having gpus).

slurm.conf (something like this)

########################
NodeName=comp1,comp2......comp8 Sockets=1 CPUs=64 CoresPerSocket=64
ThreadsPerCore=1 Gres=gpu:2
PartitionName=par1 State=UP Nodes=comp1,comp2......comp8 MaxCPUsPerNode=64
PartitionName=par1 State=UP Nodes=comp1,comp2......comp8 MaxCPUsPerNode=64
PartitionName=par2 State=UP Nodes=comp1,comp2......comp8 MaxCPUsPerNode=64
########################

Where are the things going wrong?

Q2 : How to save the job scripts permanently? I have given
SlurmdSpoolDir=/usr/local/slurm/var/spool/slurmd
AccountingStorageEnforce=safe
AccountingStoreFlags=job_script,job_env

Regards,
Purvesh
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20220410/78676338/attachment.htm>


More information about the slurm-users mailing list