Hi John,
Nice to hear from you again!
On 8/18/25 13:00, John Hearns via slurm-users wrote:
I want to run a healtcheck job on all nodes. I can select the nodes in a partition by hand, the write a bash cript to get a list of nodes using nodeset -e Then submit to each node in the list using sbatch -w
Is there a cleaner way of doing this?
IMHO the cleanest way is to use the great ClusterShell tool[1], where Slurm partitions and nodes can be configured as shown in the Wiki examples. For example, to run NHC on all nodes:
$ clush -ba nhc
Best regards, Ole
[1] https://wiki.fysik.dtu.dk/Niflheim_system/Slurm_operations/#clustershell