[slurm-users] [EXT] Re: How to find core count per job per node

Ole Holm Nielsen Ole.H.Nielsen at fysik.dtu.dk
Tue Oct 22 05:42:04 UTC 2019


Hi Tom,

I think that "pestat -j jobid" gives you the information you are asking 
for.  If not, please copy your exact output and explain why this isn't 
what you need.

Thanks,
Ole


On 21-10-2019 21:14, Tom Wurgler wrote:
> Well, not really what I needed after all.
> 
> I have 24 core nodes. I submit a 36 way job then do a pestat :
> 
> pestat -j job1
> 
> Shows node1:  24
>              node2:  12
> 
> Now submit another 36 way job.  It uses the other half of node2 and then 
> also a node3
> 
> so pestat -j job1
>              node1: 24
>              node2: 24
> 
> and pestat -j job2
>              node2: 24
>              node3: 24
> 
> I'd like it to say:
> 
> pestat -j job1
>              node1: 24
>              node2: 12
> 
> and pestat -j job2
>              node2: 12
>              node3: 24
> 
> Does that make sense?
> 
> Thanks for any info.
> 
> tom
> 
> ------------------------------------------------------------------------
> *From:* slurm-users <slurm-users-bounces at lists.schedmd.com> on behalf of 
> Ole Holm Nielsen <Ole.H.Nielsen at fysik.dtu.dk>
> *Sent:* Friday, October 18, 2019 2:15 PM
> *To:* slurm-users at lists.schedmd.com <slurm-users at lists.schedmd.com>
> *Subject:* [EXT] Re: [slurm-users] How to find core count per job per node
> WARNING: This is an EXTERNAL email. Please think before RESPONDING or 
> CLICKING on links/attachments.
> 
> 
> 
> On 18-10-2019 19:56, Tom Wurgler wrote:
>> I need to know how many cores a given job is using per node.
>> Say my nodes have 24 cores each and I run a 36 way job.
>> It take a node and a half.
>> scontrol show job id
>> shows me 36 cores, and the 2 nodes it is running on.
>> But I want to know how it split the job up between the nodes.
> 
> The "pestat" tool can tell you the CPUload of nodes belonging to a job:
> 
> pestat -j jobid
> 
> Get pestat from
> https://nam05.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FOleHolmNielsen%2FSlurm_tools%2Ftree%2Fmaster%2Fpestat&data=01%7C01%7Ctwurgl%40goodyear.com%7C331258e9a6114731131c08d753f75e7b%7C939e896692854a9a9f040887efe8aae0%7C0&sdata=P%2BZUZsyZrjyGSQq52IzYZQL6g4JSJ8FAF1vnc8gHgQI%3D&reserved=0
> 
> The "psjob" tool prints the processes on nodes of a given job when
> executed on the control node:
> 
> psjob jobid
> 
> get psjob and other tools from
> https://nam05.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FOleHolmNielsen%2FSlurm_tools%2Ftree%2Fmaster%2Fjobs&data=01%7C01%7Ctwurgl%40goodyear.com%7C331258e9a6114731131c08d753f75e7b%7C939e896692854a9a9f040887efe8aae0%7C0&sdata=1ejEnGWdiLUY9csk%2FljtAoGJl3KkKNKnz%2BoSVqKkQ3c%3D&reserved=0



More information about the slurm-users mailing list