[slurm-users] Slurm overhead
mahmood.nt at gmail.com
Thu Apr 26 02:08:06 MDT 2018
It seems that the number of threads has some effects on the
performance. Maybe some configurations issue exists in openmpi. I will
investigate more on that. Thanks guys for the tips.
On Tue, Apr 24, 2018 at 9:18 PM, Ryan Novosielski <novosirj at rutgers.edu> wrote:
> I would likely crank up the debugging on the slurmd process and look at the log files to see what’s going on in that time. You could also watch the job via top or other means (on Linux, you can press “1” to see line-by-line for each CPU core), or use strace on the process itself. Presumably something is happening that’s either eating up 4 minutes, or the job is running 4 minutes more slowly and you’ll need to figure out why. I know that our jobs run via the scheduler perform about on par for the hardware, and that jobs start fairly immediately.
More information about the slurm-users