[slurm-users] Way MaxRSS should be interpreted

E.S. Rosenberg esr+slurm-dev at mail.hebrew.edu
Tue Apr 17 04:37:04 MDT 2018


Hi fellow slurm users,
We have been struggling for a while with understanding how MaxRSS is
reported.

This because jobs often die with MaxRSS not even approaching 10% of the
requested memory sometimes.

I just found the following document:
https://research.csc.fi/-/a

It says:
"*maxrss *= maximum amount of memory used at any time by any process in
that job. This applies directly for serial jobs. For parallel jobs you need
to multiply with the number of cores (max 16 or 24 as this is reported only
for that node that used the most memory)"

While 'man sacct' says:
"Maximum resident set size of all tasks in job."

Which explanation is correct? How should I be interpreting MaxRSS?

Thanks,
Eli
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20180417/752e99f4/attachment-0001.html>


More information about the slurm-users mailing list