<div dir="ltr">For the record, the issue seemed to be related to a low CPUs weight in TRESBillingWeights being applied to different partitions. Removing it or increasing the value made the accounting work again for all users.<br></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">El mié., 26 ago. 2020 a las 17:54, Stephan Schott (<<a href="mailto:schottve@hhu.de">schottve@hhu.de</a>>) escribió:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div dir="ltr"><div dir="ltr">Still stuck with this; maybe this gives an idea to someone. Tried resetting the RawUsage by forcing slurm to regenerate assoc_usage, and though the file was generated, the RawUsage for all users now is stuck in 0. This makes me think there is a communication problem with slurmdbd (which through sreport still reports things ok btw)? Tried changing the IP address as suggested in this related problem (<a href="https://lists.schedmd.com/pipermail/slurm-users/2020-March/005051.html" target="_blank">https://lists.schedmd.com/pipermail/slurm-users/2020-March/005051.html</a>), but nothing. Both slurmctld and slurmdbd have been restarted. Any ideas?</div></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">El jue., 20 ago. 2020 a las 10:36, Stephan Schott (<<a href="mailto:schottve@hhu.de" target="_blank">schottve@hhu.de</a>>) escribió:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">Hi fellow Slurm users,<div>We are facing the following issue in Slurm 18.08 of Ubuntu Bionic (a Qlustar cluster). For the last 2+ months, one of our users has been using the queue intensively, using array jobs to handle his work. Now, the problem there is that for some reason his RawUsage hasn't increased (and is in fact close to 0), and hence his Fairshare factor is mistakenly high. The curious thing of it all is that the usage reported in sreport fits quite well with what we have seen in the last weeks. </div><div>What can cause this kind of discrepancy? All users are configured in the same way, and are using more or less the same partitions. The only difference I saw was the usage of array jobs instead of normal batch jobs, but I have no idea why that would cause differences; we are now running some tests to check if that is actually the case.</div><div>Any ideas are welcome,<br clear="all"><div><br></div>-- <br><div dir="ltr"><div dir="ltr"><div style="font-size:12.8px">Stephan Schott Verdugo<br></div><span style="font-size:12.8px">Biochemist</span><br style="font-size:12.8px"><div style="font-size:12.8px"><br>Heinrich-Heine-Universitaet Duesseldorf<br>Institut fuer Pharm. und Med. Chemie<br>Universitaetsstr. 1<br>40225 Duesseldorf<br>Germany</div></div></div></div></div>
</blockquote></div><br clear="all"><div><br></div>-- <br><div dir="ltr"><div dir="ltr"><div style="font-size:12.8px">Stephan Schott Verdugo<br></div><span style="font-size:12.8px">Biochemist</span><br style="font-size:12.8px"><div style="font-size:12.8px"><br>Heinrich-Heine-Universitaet Duesseldorf<br>Institut fuer Pharm. und Med. Chemie<br>Universitaetsstr. 1<br>40225 Duesseldorf<br>Germany</div></div></div>
</blockquote></div><br clear="all"><br>-- <br><div dir="ltr" class="gmail_signature"><div dir="ltr"><div style="font-size:12.8px">Stephan Schott Verdugo<br></div><span style="font-size:12.8px">Biochemist</span><br style="font-size:12.8px"><div style="font-size:12.8px"><br>Heinrich-Heine-Universitaet Duesseldorf<br>Institut fuer Pharm. und Med. Chemie<br>Universitaetsstr. 1<br>40225 Duesseldorf<br>Germany</div></div></div>