[slurm-users] Performance tracking of array tasks
Loris Bennett
loris.bennett at fu-berlin.de
Mon May 16 14:04:23 UTC 2022
Hi William,
William Dear <william.dear at i3-corps.com> writes:
> Could anyone please recommend methods of tracking the performance of individual tasks in a task array job? I have installed XDMoD but it is focused solely on the Job level with no information about
> tasks.
>
> My users almost exclusively use task arrays to run embarrassingly parallel jobs. After the job is complete I would like to see run time and peak RAM usage per task so that we can correctly size the
> reservations for future jobs. It would also be very helpful to break this down by node so that I can identify poorly performing nodes.
>
> William Dear
I'm not sure what you mean by a 'task array job'. A job can have
multiple tasks within it - I don't think you will be able to get data on
such individual tasks very easily. However, a job array is just a sort
of convenient wrapper around a bunch of jobs. Each element of a job
array still has its own job ID, so you can extract job data the same way
you do for a non-array job.
Cheers,
Loris
--
Dr. Loris Bennett (Herr/Mr)
ZEDAT, Freie Universität Berlin Email loris.bennett at fu-berlin.de
More information about the slurm-users
mailing list