[slurm-users] ANNOUNCE: A new showuserlimits tool for printing Slurm user resource limits and usage
Ole Holm Nielsen
Ole.H.Nielsen at fysik.dtu.dk
Wed Aug 21 09:16:33 UTC 2019
Dear Slurm users,
It is very useful to view Slurm a user's resource limits and current
usage. For example, jobs may be blocked because some resource limit gets
exceeded, and it is important to analyze why this occurs.
Several Slurm commands such as sshare and sacctmgr can print a number of
user limits, and to a lesser extent the user's current usage, however,
their capabilities are very limited.
The showuserlimits tool fills this need by inquiring the Slurm database
about all available user and association limits and current usages. The
amount of information in the database is quite extensive, so the
showuserlimits tool allows filtering the data and print only the desired
information. An output example is:
$ showuserlimits -u xxx -l GrpTRESRunMins -s cpu
Association (User):
ClusterName = niflheim
Account = camdvip
UserName = xxx, current value or id = 1777
Partition = None, current value or id = Any partition
GrpTRESRunMins =
cpu: Limit = 7000000, current value = 2800752
The showuserlimits tool can be downloaded from:
https://github.com/OleHolmNielsen/Slurm_tools/tree/master/showuserlimits
The showuserlimits tool is used by the showjob command available from
the scripts for managing jobs:
https://github.com/OleHolmNielsen/Slurm_tools/tree/master/jobs
If you have comments or suggestions regarding these tools, please send
me a mail.
Best regards,
Ole
--
Ole Holm Nielsen
PhD, Senior HPC Officer
Department of Physics, Technical University of Denmark
More information about the slurm-users
mailing list