[slurm-users] [Slurm 18.08.4] sacct/seff Inaccurate usercpu values

Michael Robbert mrobbert at mines.edu
Wed Jan 16 15:37:03 UTC 2019


Andreas,

Look again. I just looked and a commit to the source code was posted to 
the bug yesterday afternoon. It looks like that patch applies to the 
cgroup plugin. It won't show up until the next release, but at least 
there is a fix available.

Mike Robbert

On 1/15/19 11:43 PM, Henkel, Andreas wrote:
> Bad news Dir the cgroup-Users, seems like the bug is „resolved“ by the site switching to task/Linux instead :-(
>
>> Am 09.01.2019 um 22:06 schrieb Christopher Benjamin Coffey <Chris.Coffey at nau.edu>:
>>
>> Thanks... looks like the bug should get some attention now that a paying site is complaining:
>>
>> https://bugs.schedmd.com/show_bug.cgi?id=6332
>>
>> Thanks Jurij!
>>
>> Best,
>> Chris
>>
>>>> Christopher Coffey
>> High-Performance Computing
>> Northern Arizona University
>> 928-523-1167
>>
>>
>> On 1/9/19, 7:24 AM, "slurm-users on behalf of Paddy Doyle" <slurm-users-bounces at lists.schedmd.com on behalf of paddy at tchpc.tcd.ie> wrote:
>>
>>>     On Wed, Jan 09, 2019 at 12:44:03PM +0100, Bj?rn-Helge Mevik wrote:
>>>
>>> Paddy Doyle <paddy at tchpc.tcd.ie> writes:
>>>
>>>> Looking back through the mailing list, it seems that from 2015 onwards the
>>>> recommendation from Danny was to use 'jobacct_gather/linux' instead of
>>>> 'jobacct_gather/cgroup'. I didn't pick up on that properly, so we kept with
>>>> the cgroup version.
>>>>
>>>> Is anyone else still using jobacct_gather/cgroup and are you seeing this
>>>> same issue?
>>> Just a side note: In last year's SLUG, Tim recommended the following
>>> settings:
>>>
>>> proctrack/cgroup, task/cgroup, jobacct_gather/cgroup
>>>
>>> So the recommendation for jobacct_gather might have changed -- or Danny
>>> and Tim might just have different opinions. :)
>>     Interesting... the cgroups documentation page still says the performance of
>>     jobacct_gather/cgroup is worse than jobacct_gather/linux. Although
>>     according to the git commits of doc/html/cgroups.shtml, that was added to
>>     the page in Jan 2015, so yeah maybe things have changed again. :)
>>
>>     https://na01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fslurm.schedmd.com%2Fcgroups.html&data=02%7C01%7Cchris.coffey%40nau.edu%7C2e47d9c9330646a8245f08d6763e2346%7C27d49e9f89e14aa099a3d35b57b2ba03%7C0%7C0%7C636826406595983378&sdata=i634oCV0NeO6DvBos05gM3iF7YxI%2FJC%2BZC7MJ222SW8%3D&reserved=0
>>
>>     In that case, either set 'JobAcctGatherFrequency=task=0' or wait for the
>>     bug to be fixed.
>>
>>     Paddy
>>
>>     --
>>     Paddy Doyle
>>     Trinity Centre for High Performance Computing,
>>     Lloyd Building, Trinity College Dublin, Dublin 2, Ireland.
>>     Phone: +353-1-896-3725
>>     https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.tchpc.tcd.ie%2F&data=02%7C01%7Cchris.coffey%40nau.edu%7C2e47d9c9330646a8245f08d6763e2346%7C27d49e9f89e14aa099a3d35b57b2ba03%7C0%7C0%7C636826406595983378&sdata=S2PCubxVUifigrvyEnmFdrQb5G9Ak4roM2FJtUxiM%2Fw%3D&reserved=0
>>
>>
>>


More information about the slurm-users mailing list