On Tue, 2025-08-19 at 10:20 -0400, Paul Edmon via slurm-users wrote:
We've been using NHC (https://github.com/mej/nhc) for years with much success. However that project hasn't had a release in 2 years and the various Issues filed indicate that there might be problems with Rocky 9 (which we are looking to upgrade to). Do people that are at EL9 use NHC? Is there a fork? Is there a different code that people use for doing node health checks?
-Paul Edmon-
I guess that checking for open ports with nmap should be sufficient to tell that the daemons are up and responding:
nmap -p <port> <ip-address range>
VB