[slurm-users] Serial jobs on multi-core nodes using whole compute node

Nicholas Yue yue.nicholas at gmail.com
Wed Jan 1 19:49:17 UTC 2020


Hi,

  I am new to SLURM

  I am experimenting with a small cluster, 1 head node (with GPU for
visualization, not for CUDA jobs yet) and 2 compute nodes

  Each machine have quad core processors but no hyperthreading.

  I have the following test job:

---------------------------------------------------------------------
#!/bin/bash
#SBATCH --job-name=producer               # Job name
#SBATCH --output=producer.%j.log          # Standard output and error log
#SBATCH --cpus-per-task=1

/mnt/nfs/projects/workflow-languages/build/producer output.$SLURM_JOB_ID.txt
---------------------------------------------------------------------

My slurm.conf is this, is the configuration wrong ?
---------------------------------------------------------------------
# slurm.conf file generated by configurator easy.html.
# Put this file on all nodes of your cluster.
# See the slurm.conf man page for more information.
#
ControlMachine=head0
ControlAddr=192.168.0.10
#
#MailProg=/bin/mail
MpiDefault=none
#MpiParams=ports=#-#
ProctrackType=proctrack/pgid
ReturnToService=1
SlurmctldPidFile=/var/run/slurm-llnl/slurmctld.pid
#SlurmctldPort=6817
SlurmdPidFile=/var/run/slurm-llnl/slurmd.pid
#SlurmdPort=6818
SlurmdSpoolDir=/var/lib/slurm-llnl/slurmd
SlurmUser=slurm
#SlurmdUser=root
StateSaveLocation=/var/lib/slurm-llnl/slurmctld
SwitchType=switch/none
TaskPlugin=task/none
#
#
# TIMERS
#KillWait=30
#MinJobAge=300
#SlurmctldTimeout=120
#SlurmdTimeout=300
#
#
# SCHEDULING
FastSchedule=1
SchedulerType=sched/backfill
#SchedulerPort=7321
SelectType=select/linear
#
#
# LOGGING AND ACCOUNTING
AccountingStorageType=accounting_storage/none
ClusterName=cluster
#JobAcctGatherFrequency=30
JobAcctGatherType=jobacct_gather/none
#SlurmctldDebug=3
SlurmctldLogFile=/var/log/slurm-llnl/slurmctld.log
#SlurmdDebug=3
SlurmdLogFile=/var/log/slurm-llnl/slurmd.log
#
#
# COMPUTE NODES
NodeName=head0 CoresPerSocket=4 ThreadsPerCore=1 RealMemory=32768
State=UNKNOWN
NodeName=compute[0-1] CoresPerSocket=4 ThreadsPerCore=1 RealMemory=16384
State=UNKNOWN
PartitionName=cfd Nodes=head0,compute[0-1] Default=YES MaxTime=INFINITE
State=UP
---------------------------------------------------------------------

Cheers
-- 
Nicholas Yue
Graphics - Arnold, Alembic, RenderMan, OpenGL, HDF5
Custom Dev - C++ porting, OSX, Linux, Windows
http://au.linkedin.com/in/nicholasyue
https://vimeo.com/channels/naiadtools
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20200101/fbbb2234/attachment.htm>


More information about the slurm-users mailing list