[slurm-users] [EXTERNAL] Re: Information about finished jobs
Greg Wickham
greg.wickham at kaust.edu.sa
Mon Jun 14 08:49:57 UTC 2021
As others have commented, some information is lost when it is stored in the database.
To keep historically accurate data on the job run a script (refer to PrologSlurmctld in slurm.conf) that runs an "scontrol show -d job <jobid>" and drops it into a local file.
Using " PrologSlurmctld" is neat, as it is executed on the slurmctld host when the job is being launched. (interestingly the job state will be shown as "CONFIGURING").
Side note - using "-d " with scontrol will provide accurate allocation of resources on each node (specific CPUs, specific GPUs, and memory).
-Greg
On 14/06/2021, 10:37, "slurm-users on behalf of Arthur Gilly" <slurm-users-bounces at lists.schedmd.com on behalf of arthur.gilly at helmholtz-muenchen.de> wrote:
Hi all,
A related question, on my setup, scontrol show job displays the standard
output, standard error redirections as well as the wd, whereas this info is
lost after completion when sacct is required. Is this something that's
configurable so that this info is preserved with sacct?
Cheers,
A
-------------------------------------------------------------
Dr. Arthur Gilly
Head of Analytics
Institute of Translational Genomics
Helmholtz-Centre Munich (HMGU)
-------------------------------------------------------------
--
More information about the slurm-users
mailing list