[slurm-users] [Slurm 18.08.4] sacct/seff Inaccurate usercpu values

Henkel, Andreas henkel at uni-mainz.de
Fri Jan 18 07:48:33 UTC 2019


Thank you Mike. Didn’t see that yet.

> Am 16.01.2019 um 16:57 schrieb Michael Robbert <mrobbert at mines.edu>:
> 
> Andreas,
> 
> Look again. I just looked and a commit to the source code was posted to 
> the bug yesterday afternoon. It looks like that patch applies to the 
> cgroup plugin. It won't show up until the next release, but at least 
> there is a fix available.
> 
> Mike Robbert
> 
>> On 1/15/19 11:43 PM, Henkel, Andreas wrote:
>> Bad news Dir the cgroup-Users, seems like the bug is „resolved“ by the site switching to task/Linux instead :-(
>> 
>>> Am 09.01.2019 um 22:06 schrieb Christopher Benjamin Coffey <Chris.Coffey at nau.edu>:
>>> 
>>> Thanks... looks like the bug should get some attention now that a paying site is complaining:
>>> 
>>> https://bugs.schedmd.com/show_bug.cgi?id=6332
>>> 
>>> Thanks Jurij!
>>> 
>>> Best,
>>> Chris
>>> 
>>>>>> Christopher Coffey
>>> High-Performance Computing
>>> Northern Arizona University
>>> 928-523-1167
>>> 
>>> 
>>> On 1/9/19, 7:24 AM, "slurm-users on behalf of Paddy Doyle" <slurm-users-bounces at lists.schedmd.com on behalf of paddy at tchpc.tcd.ie> wrote:
>>> 
>>>>    On Wed, Jan 09, 2019 at 12:44:03PM +0100, Bj?rn-Helge Mevik wrote:
>>>> 
>>>> Paddy Doyle <paddy at tchpc.tcd.ie> writes:
>>>> 
>>>>> Looking back through the mailing list, it seems that from 2015 onwards the
>>>>> recommendation from Danny was to use 'jobacct_gather/linux' instead of
>>>>> 'jobacct_gather/cgroup'. I didn't pick up on that properly, so we kept with
>>>>> the cgroup version.
>>>>> 
>>>>> Is anyone else still using jobacct_gather/cgroup and are you seeing this
>>>>> same issue?
>>>> Just a side note: In last year's SLUG, Tim recommended the following
>>>> settings:
>>>> 
>>>> proctrack/cgroup, task/cgroup, jobacct_gather/cgroup
>>>> 
>>>> So the recommendation for jobacct_gather might have changed -- or Danny
>>>> and Tim might just have different opinions. :)
>>>    Interesting... the cgroups documentation page still says the performance of
>>>    jobacct_gather/cgroup is worse than jobacct_gather/linux. Although
>>>    according to the git commits of doc/html/cgroups.shtml, that was added to
>>>    the page in Jan 2015, so yeah maybe things have changed again. :)
>>> 
>>>    https://na01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fslurm.schedmd.com%2Fcgroups.html&data=02%7C01%7Cchris.coffey%40nau.edu%7C2e47d9c9330646a8245f08d6763e2346%7C27d49e9f89e14aa099a3d35b57b2ba03%7C0%7C0%7C636826406595983378&sdata=i634oCV0NeO6DvBos05gM3iF7YxI%2FJC%2BZC7MJ222SW8%3D&reserved=0
>>> 
>>>    In that case, either set 'JobAcctGatherFrequency=task=0' or wait for the
>>>    bug to be fixed.
>>> 
>>>    Paddy
>>> 
>>>    --
>>>    Paddy Doyle
>>>    Trinity Centre for High Performance Computing,
>>>    Lloyd Building, Trinity College Dublin, Dublin 2, Ireland.
>>>    Phone: +353-1-896-3725
>>>    https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.tchpc.tcd.ie%2F&data=02%7C01%7Cchris.coffey%40nau.edu%7C2e47d9c9330646a8245f08d6763e2346%7C27d49e9f89e14aa099a3d35b57b2ba03%7C0%7C0%7C636826406595983378&sdata=S2PCubxVUifigrvyEnmFdrQb5G9Ak4roM2FJtUxiM%2Fw%3D&reserved=0
>>> 
>>> 
>>> 


More information about the slurm-users mailing list