[slurm-users] virtual memory limit exceeded

Chris Samuel chris at csamuel.org
Fri Nov 9 04:51:01 MST 2018


On Friday, 9 November 2018 2:16:48 PM AEDT Noam Bernstein wrote:

> Can anyone shed some light on where the _virtual_ memory limit comes from? 
>
> We're getting jobs killed with the message
> slurmstepd: error: Step 3664.0 exceeded virtual memory limit (79348101120 > 72638634393), being killed
>
> Is this a limit that's dictated by cgroup.conf

It's not cgroups, that is enforced by the kernel instead, whereas this
is Slurm monitoring jobs and deciding it's used too much memory
and it needs to kill it.

All the best,
Chris
-- 
 Chris Samuel  :  http://www.csamuel.org/  :  Melbourne, VIC






More information about the slurm-users mailing list