[slurm-users] Updated Slurm tool "pestat" (Processor Element status)

Ryan Novosielski novosirj at rutgers.edu
Wed Nov 21 07:23:02 MST 2018


Thanks Olm! I am quite fond of your utilities — thank you for providing them. 

Sent from my iPhone

> On Nov 21, 2018, at 08:51, Ole Holm Nielsen <Ole.H.Nielsen at fysik.dtu.dk> wrote:
> 
> Dear Slurm users,
> 
> The Slurm tool "pestat" (Processor Element status) has been enhanced due to a user request.  Now pestat will display an additional available GRES column for the nodes if the -G flag is used.  This is useful if your nodes have GPUs installed.
> 
> The pestat tool prints a Slurm cluster nodes status with 1 line per node and job info.  Download pestat from https://github.com/OleHolmNielsen/Slurm_tools/tree/master/pestat
> 
> The pestat tool has many options:
> 
> # pestat -h
> Usage: pestat [-p partition(s)] [-u username] [-g groupname]
>    [-q qoslist] [-s statelist] [-n/-w hostlist] [-j joblist] [-G] [-N]
>    [-f | -F | -m free_mem | -M free_mem ] [-1|-2] [-d] [-E] [-C|-c] [-V] [-h]
> where:
>    -p partition: Select only partion <partition>
>    -u username: Print only user <username>
>    -g groupname: Print only users in UNIX group <groupname>
>    -q qoslist: Print only QOS in the qoslist <qoslist>
>    -R reservationlist: Print only node reservations <reservationlist>
>    -s statelist: Print only nodes with state in <statelist>
>    -n/-w hostlist: Print only nodes in hostlist
>    -j joblist: Print only nodes in job <joblist>
>    -G: Print GRES (Generic Resources) in addition to JobId
>    -N: Print JobName in addition to JobId
>    -f: Print only nodes that are flagged by * (unexpected load etc.)
>    -F: Like -f, but only nodes flagged in RED are printed.
>    -m free_mem: Print only nodes with free memory LESS than free_mem MB
>    -M free_mem: Print only nodes with free memory GREATER than free_mem MB (under-utilized)
>    -d: Omit nodes with states: down drained
>    -1: Default: Only 1 line per node (unique nodes in multiple partitions are printed once only)
>    -2: 2..N lines per node which participates in multiple partitions
>    -E: Job EndTime is printed after each jobid/user
>    -C: Color output is forced ON
>    -c: Color output is forced OFF
>    -h: Print this help information
>    -V: Version information
> 
> My monitoring of jobs is usually done simply with "pestat -F", and also with "pestat -s mix".
> 
> /Ole
> 
> 
> -- 
> Ole Holm Nielsen
> PhD, Senior HPC Officer
> Department of Physics, Technical University of Denmark
> 


More information about the slurm-users mailing list