[slurm-users] [External] Sinfo or squeue stuck for some seconds
fzillner at lenovo.com
Mon Aug 30 12:47:19 UTC 2021
could it be that you're using LDAP/AD/NIS for user management? If so, check if the LDAP servers response is slow or gets slowed down when retrieving hundreds or thousands of users.
Also CacheGroups=1 was last supported in V15.08.
From: slurm-users <slurm-users-bounces at lists.schedmd.com> on behalf of navin srivastava <navin.altair at gmail.com>
Sent: Sunday, 29 August 2021 16:53
To: Slurm User Community List <slurm-users at lists.schedmd.com>
Subject: [External] [slurm-users] Sinfo or squeue stuck for some seconds
Dear slurm community users,
We are using slurm version 20.02.x.
We see the below message appearing a lot of times in slurmctld log and found that whenever this message is appearing the sinfo/squeue out gets slow.
No timeout as i kept the value 100.
Warning: Note very large processing time from load_part_uid_allow_list: usec=10800885 began=16:27:55.952
[2021-08-29T16:28:06.753] Warning: Note very large processing time from _slurmctld_background: usec=10801120 began=16:27:55.952
Is this a bug or some config issue. if anybody faced the similar issue.could anybody throw some light on this.
please find the attached slurm.conf.
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the slurm-users