I guess I have the syntax wrong,

root@node1 slurm]# /usr/sbin/slurmd -D
slurmd: fatal: Unable to create NodeAddr list from node[1-7].ods.vuw.ac.nz
[root@node1 slurm]# tail /etc/slurm/slurm.conf
#ResumeRate=
#SuspendExcNodes=
#SuspendExcParts=
#SuspendRate=
#SuspendTime=
#
#
# COMPUTE NODES
NodeName=node[1-7].ods.vuw.ac.nz CPUs=20 RealMemory=48 State=UNKNOWN
PartitionName=debug Nodes=ALL Default=YES MaxTime=INFINITE State=UP
[root@node1 slurm]#


regards

Steven



From: Steven Jones via slurm-users <slurm-users@lists.schedmd.com>
Sent: Wednesday, 4 December 2024 1:28 pm
To: slurm-users@schedmd.com <slurm-users@schedmd.com>
Subject: [slurm-users] Re: Slurm not running on a warewulf node
 
Well that is a start, TY.

[root@node1 slurm]# /usr/sbin/slurmd -D
slurmd: fatal: Unable to determine this slurmd's NodeName

Where is this set?

regards

Steven 



From: Jeffrey R. Lang <JRLang@uwyo.edu>
Sent: Wednesday, 4 December 2024 1:17 pm
To: Steven Jones <steven.jones@vuw.ac.nz>; slurm-users@schedmd.com <slurm-users@schedmd.com>
Subject: RE: Slurm not running on a warewulf node
 
You don't often get email from jrlang@uwyo.edu. Learn why this is important

Steve

 

  Trying running the failing process from the command line and use the -D option.

 

Per man page: Run slurmd in the foreground. Error and debug messages will be copied to stderr.

 

Jeffrey R. Lang

Advanced Research Computing Center

University of Wyoming, Information Technology Center

1000 E. University Ave

Laramie,  WY 82071

 

Email: jrlang@uwyo.edu

Work: 307.766.3381

 

From: Steven Jones via slurm-users <slurm-users@lists.schedmd.com>
Sent: Tuesday, December 3, 2024 5:39 PM
To: slurm-users@schedmd.com
Subject: [slurm-users] slurm not running on a warewulf node

 

 This message was sent from a non-UWYO address. Please exercise caution when clicking links or opening attachments from external sources.

 

Hi,

I have set a log creation/location in slurm.conf   as,

SlurmdLogFile=/var/log/slurm/slurmd.log

But it is 0 length.

Slurm will not run, what else do I need to do to log why its failing pls?

 

 

regards

Steven