[slurm-users] EXTERNAL: Re: Memory per CPU
Ryan Novosielski
novosirj at rutgers.edu
Wed Sep 30 14:00:46 UTC 2020
Primary one I’m aware of is that resource use is better reported (or at all in some cases) via srun, and srun can take care of MPI for an MPI job. I’m sure there are others as well (I guess avoiding another place where you have to describe the resources to be used and making sure they match, in the case of mpirun, etc.).
--
____
|| \\UTGERS, |---------------------------*O*---------------------------
||_// the State | Ryan Novosielski - novosirj at rutgers.edu<mailto:novosirj at rutgers.edu>
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\ of NJ | Office of Advanced Research Computing - MSB C630, Newark
`'
On Sep 30, 2020, at 09:38, Luecht, Jeff A <jeff.luecht at pnc.com> wrote:
First off, I want to thank everyone for their input and suggestions. They were very helpful an ultimately pointed me in the right direction. I spent several hours playing around with various settings.
Some additional background. When the srun command is used to execute this job, we do not see this issue. We only see it in SBATCH.
What I ultimate did was the following:
1 - Change the NodeName to add the specific parameters Sockets, Cores and Threads.
2 - Changed the DefMemPerCPU/MaxMemCPU to 16144/12228 instead of 6000/12000 respectively
I tested jobs after the above changes and used 'scontrol --defaults job <ID>' command. The CPU allocation now works as expected.
I do have one question though - what is the benefit/recommendation of using srun to execute a process within SBATCH. We are running primarily python jobs, but need to also support R jobs.
-----Original Message-----
From: slurm-users [mailto:slurm-users-bounces at lists.schedmd.com] On Behalf Of Diego Zuccato
Sent: Wednesday, September 30, 2020 2:18 AM
To: Slurm User Community List <slurm-users at lists.schedmd.com>; Michael Di Domenico <mdidomenico4 at gmail.com>
Subject: EXTERNAL: Re: [slurm-users] Memory per CPU
** This email has been received from outside the organization – Think before clicking on links, opening attachments, or responding. **
Il 29/09/20 16:19, Michael Di Domenico ha scritto:
what leads you to believe that you're getting 2 CPU's instead of 1?
I think I saw that too, once, but thought it was related to hyperthreading.
--
Diego Zuccato
DIFA - Dip. di Fisica e Astronomia
Servizi Informatici
Alma Mater Studiorum - Università di Bologna V.le Berti-Pichat 6/2 - 40127 Bologna - Italy
tel.: +39 051 20 95786
The contents of this email are the property of PNC. If it was not addressed to you, you have no legal right to read it. If you think you received it in error, please notify the sender. Do not forward or copy without permission of the sender. This message may be considered a commercial electronic message under Canadian law or this message may contain an advertisement of a product or service and thus may constitute a commercial electronic mail message under US law. You may unsubscribe at any time from receiving commercial electronic messages from PNC at http://pages.e.pnc.com/globalunsub/
PNC, 249 Fifth Avenue, Pittsburgh, PA 15222; pnc.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20200930/30f846ee/attachment-0001.htm>
More information about the slurm-users
mailing list