[slurm-users] [Slurm 18.08.4] sacct/seff Inaccurate usercpu on Job Arrays

Paddy Doyle paddy at tchpc.tcd.ie
Wed Jan 9 07:19:16 MST 2019


On Wed, Jan 09, 2019 at 12:44:03PM +0100, Bj?rn-Helge Mevik wrote:

> Paddy Doyle <paddy at tchpc.tcd.ie> writes:
> 
> > Looking back through the mailing list, it seems that from 2015 onwards the
> > recommendation from Danny was to use 'jobacct_gather/linux' instead of
> > 'jobacct_gather/cgroup'. I didn't pick up on that properly, so we kept with
> > the cgroup version.
> >
> > Is anyone else still using jobacct_gather/cgroup and are you seeing this
> > same issue?
> 
> Just a side note: In last year's SLUG, Tim recommended the following
> settings:
> 
> proctrack/cgroup, task/cgroup, jobacct_gather/cgroup
> 
> So the recommendation for jobacct_gather might have changed -- or Danny
> and Tim might just have different opinions. :)

Interesting... the cgroups documentation page still says the performance of
jobacct_gather/cgroup is worse than jobacct_gather/linux. Although
according to the git commits of doc/html/cgroups.shtml, that was added to
the page in Jan 2015, so yeah maybe things have changed again. :)

https://slurm.schedmd.com/cgroups.html

In that case, either set 'JobAcctGatherFrequency=task=0' or wait for the
bug to be fixed.

Paddy

-- 
Paddy Doyle
Trinity Centre for High Performance Computing,
Lloyd Building, Trinity College Dublin, Dublin 2, Ireland.
Phone: +353-1-896-3725
http://www.tchpc.tcd.ie/



More information about the slurm-users mailing list