[slurm-users] sacct issue: jobs staying in "RUNNING" state

Chris Samuel chris at csamuel.org
Wed Jul 17 04:26:16 UTC 2019


On 16/7/19 11:43 am, Will Dennis wrote:

> [2019-07-16T09:36:51.464] error: slurmdbd: agent queue is full (20140), 
> discarding DBD_STEP_START:1442 request

So it looks like your slurmdbd cannot keep up with the rate of these 
incoming steps and is having to throw away messages.

> [2019-07-16T09:40:27.515] error: slurmdbd: agent queue filling (20140), 
> RESTART SLURMDBD NOW

Have you tried doing what it told you to?

You may want to look at the performance of you MySQL server to see if 
it's failing to keep up with what slurmdbd is asking it to do.

All the best,
Chris
-- 
  Chris Samuel  :  http://www.csamuel.org/  :  Berkeley, CA, USA



More information about the slurm-users mailing list