[slurm-users] Show detailed information from a finished job
mercan
ahmet.mercan at uhem.itu.edu.tr
Thu Apr 23 09:43:45 UTC 2020
Sorry, I falsely crop the "mkdir" line at below:
mkdir -p $JDIR
I should be after "JDIR=/okyanus/..." line
Regards;
Ahmet M.
23.04.2020 12:31 tarihinde mercan yazdı:
> Hi;
>
> I prefer to use epilog script to store the job information to a top
> directory owned by the slurm user. To avoid a directory with a lot of
> files, It creates a sub-directory for a thousand job file. For a job
> which its jobid is 230988, It creates a directory named as 230XXX.
> Also the SLURM_JOB_ID of a job array is a problem, because of the
> slurm uses an ugly format (298903_[3%1]). Because of these reasons, my
> script is little complex, but it works (I crop the other non-relevant
> things):
>
> #!/bin/bash
>
> if [ "x$SLURM_ARRAY_JOB_ID" != "x" ]
> then
> JOBNO="${SLURM_ARRAY_JOB_ID}_${SLURM_ARRAY_TASK_ID}"
> else
> JOBNO="${SLURM_JOB_ID}"
> fi
> JI=${JOBNO//_*/}
> JWIDE=${#JI}
> JIDLEN=0
> $((JIDLEN=JWIDE-3))
> JDIR=/okyanus/SLURM/log/jobs/${JI:0:$JIDLEN}XXX
> echo
> "==========================================================================="
> &>>$JDIR/${JI}.txt
> scontrol show job -dd "$JOBNO" &>>$JDIR/${JI}.txt && echo
> "==========================================================================="
> >>$JDIR/${JI}.txt && scontrol write batch_script "$SLURM_JOBID" -
> >>$JDIR/${JI}.txt
> exit 0
>
> Regards;
>
> Ahmet M.
>
>
> 23.04.2020 10:33 tarihinde Gestió Servidors yazdı:
>>
>> Hello,
>>
>> When a job is “pending” or “running”, with “scontrol show
>> jobid=#jobjumber” I can get some usefull information, but when the
>> job has finished, that command doesn’t return anything. For example,
>> if I run a “sacct” and I see that some jobs have finished with state
>> “FAILED”, how can I get detailed information from that job?
>>
>> Thanks.
>>
>
More information about the slurm-users
mailing list