[slurm-users] New checktopology tool: Check consistency of /etc/slurm/topology.conf with nodelist in /etc/slurm/slurm.conf

Ole Holm Nielsen Ole.H.Nielsen at fysik.dtu.dk
Mon Jan 21 10:10:09 UTC 2019


Hi Slurm users,

When removing dead nodes or adding new nodes, I've too often made the 
mistake of forgetting to update correctly the /etc/slurm/topology.conf file.

Therefore I wrote a simple "checktopology" tool to check the consistency 
of /etc/slurm/topology.conf with the nodelist in /etc/slurm/slurm.conf 
(as presented by the sinfo command).  It's available from my Github 
project at https://github.com/OleHolmNielsen/Slurm_tools/tree/master/nodes

An example output is:

$ checktopology
Extract nodelist from /etc/slurm/topology.conf
Differences between Slurm nodelist and nodes in /etc/slurm/topology.conf
*** /tmp/nodelist.12694	2019-01-21 10:07:01.468864295 +0100
--- /tmp/topologylist.12694	2019-01-21 10:07:01.365857415 +0100
***************
*** 368,373 ****
--- 368,378 ----
   c088
   c089
   c090
+ c091
+ c092
+ c093
+ c094
+ c095
   d001
   d002
   d003
***************
*** 595,600 ****
--- 600,606 ----
   i048
   i049
   i050
+ i051
   x001
   x002
   x003

Comments and suggestions are most welcome!

FYI: My Slurm Wiki contains available information about adding/removing 
nodes: https://wiki.fysik.dtu.dk/niflheim/SLURM#add-and-remove-nodes

-- 
Ole Holm Nielsen
Department of Physics, Technical University of Denmark



More information about the slurm-users mailing list