[slurm-users] [EXTERNAL] Re: Information about finished jobs

Greg Wickham greg.wickham at kaust.edu.sa
Mon Jun 14 08:49:57 UTC 2021


As others have commented, some information is lost when it is stored in the database.

To keep historically accurate data on the job run a script (refer to PrologSlurmctld in slurm.conf) that runs an "scontrol show -d job <jobid>" and drops it into a local file.

Using " PrologSlurmctld" is neat, as it is executed on the slurmctld host when the job is being launched. (interestingly the job state will be shown as "CONFIGURING").

Side note - using "-d " with scontrol will provide accurate allocation of resources on each node (specific CPUs, specific GPUs, and memory).

  -Greg

On 14/06/2021, 10:37, "slurm-users on behalf of Arthur Gilly" <slurm-users-bounces at lists.schedmd.com on behalf of arthur.gilly at helmholtz-muenchen.de> wrote:

    Hi all,

    A related question, on my setup, scontrol show job displays the standard
    output, standard error redirections as well as the wd, whereas this info is
    lost after completion when sacct is required. Is this something that's
    configurable so that this info is preserved with sacct?

    Cheers,

    A

    -------------------------------------------------------------
    Dr. Arthur Gilly
    Head of Analytics
    Institute of Translational Genomics
    Helmholtz-Centre Munich (HMGU)
    -------------------------------------------------------------

--



More information about the slurm-users mailing list