[slurm-users] fast way for a node to determine its own state?

Ole Holm Nielsen Ole.H.Nielsen at fysik.dtu.dk
Wed Mar 21 04:35:47 MDT 2018


On 03/21/2018 11:18 AM, Alexis Huxley wrote:
> I'm running a node health script that needs to know the state of
> the node on which it is running. Currently, I'm getting the
> state with this:
> 
> 	sinfo -N ... | grep `uname -n`
> 
> Depending on the load on the scheduler, this can be slow. Is there
> faster way? Perhaps one that doesn't involve communicating with
> the scheduler node? Thanks!

How about using Node Health Check (NHC) from https://github.com/mej/nhc 
?  I've written some practical info about NHC in the Wiki page 
https://wiki.fysik.dtu.dk/niflheim/Slurm_configuration#node-health-check

/Ole



More information about the slurm-users mailing list