[slurm-users] [EXT] Re: How to find core count per job per node
Tom Wurgler
twurgl at goodyear.com
Mon Oct 21 19:14:45 UTC 2019
Well, not really what I needed after all.
I have 24 core nodes. I submit a 36 way job then do a pestat :
pestat -j job1
Shows node1: 24
node2: 12
Now submit another 36 way job. It uses the other half of node2 and then also a node3
so pestat -j job1
node1: 24
node2: 24
and pestat -j job2
node2: 24
node3: 24
I'd like it to say:
pestat -j job1
node1: 24
node2: 12
and pestat -j job2
node2: 12
node3: 24
Does that make sense?
Thanks for any info.
tom
________________________________
From: slurm-users <slurm-users-bounces at lists.schedmd.com> on behalf of Ole Holm Nielsen <Ole.H.Nielsen at fysik.dtu.dk>
Sent: Friday, October 18, 2019 2:15 PM
To: slurm-users at lists.schedmd.com <slurm-users at lists.schedmd.com>
Subject: [EXT] Re: [slurm-users] How to find core count per job per node
WARNING: This is an EXTERNAL email. Please think before RESPONDING or CLICKING on links/attachments.
On 18-10-2019 19:56, Tom Wurgler wrote:
> I need to know how many cores a given job is using per node.
> Say my nodes have 24 cores each and I run a 36 way job.
> It take a node and a half.
> scontrol show job id
> shows me 36 cores, and the 2 nodes it is running on.
> But I want to know how it split the job up between the nodes.
The "pestat" tool can tell you the CPUload of nodes belonging to a job:
pestat -j jobid
Get pestat from
https://nam05.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FOleHolmNielsen%2FSlurm_tools%2Ftree%2Fmaster%2Fpestat&data=01%7C01%7Ctwurgl%40goodyear.com%7C331258e9a6114731131c08d753f75e7b%7C939e896692854a9a9f040887efe8aae0%7C0&sdata=P%2BZUZsyZrjyGSQq52IzYZQL6g4JSJ8FAF1vnc8gHgQI%3D&reserved=0
The "psjob" tool prints the processes on nodes of a given job when
executed on the control node:
psjob jobid
get psjob and other tools from
https://nam05.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FOleHolmNielsen%2FSlurm_tools%2Ftree%2Fmaster%2Fjobs&data=01%7C01%7Ctwurgl%40goodyear.com%7C331258e9a6114731131c08d753f75e7b%7C939e896692854a9a9f040887efe8aae0%7C0&sdata=1ejEnGWdiLUY9csk%2FljtAoGJl3KkKNKnz%2BoSVqKkQ3c%3D&reserved=0
/Ole
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20191021/a40b76cb/attachment.htm>
More information about the slurm-users
mailing list