[slurm-users] sacct issue: jobs staying in "RUNNING" state
Chris Samuel
chris at csamuel.org
Wed Jul 17 04:26:16 UTC 2019
On 16/7/19 11:43 am, Will Dennis wrote:
> [2019-07-16T09:36:51.464] error: slurmdbd: agent queue is full (20140),
> discarding DBD_STEP_START:1442 request
So it looks like your slurmdbd cannot keep up with the rate of these
incoming steps and is having to throw away messages.
> [2019-07-16T09:40:27.515] error: slurmdbd: agent queue filling (20140),
> RESTART SLURMDBD NOW
Have you tried doing what it told you to?
You may want to look at the performance of you MySQL server to see if
it's failing to keep up with what slurmdbd is asking it to do.
All the best,
Chris
--
Chris Samuel : http://www.csamuel.org/ : Berkeley, CA, USA
More information about the slurm-users
mailing list