[slurm-users] howto list/get all scripts run by a job?
Adrian Sevcenco
Adrian.Sevcenco at spacescience.ro
Fri Jun 19 09:39:56 UTC 2020
On 6/19/20 12:35 PM, mercan wrote:
> Hi;
>
> For running jobs, you can get the running script with using:
>
> scontrol write batch_script "$SLURM_JOBID" -
wow, thanks a lot!!!
Adrian
>
> command. the - parameter reqired for screen output.
>
> Ahmet M.
>
>
> On 19.06.2020 12:25, Adrian Sevcenco wrote:
>> On 6/18/20 9:35 AM, Loris Bennett wrote:
>>> Hi Adrain,
>> Hi
>>
>>> Adrian Sevcenco <Adrian.Sevcenco at spacescience.ro> writes:
>>>
>>>> Hi! I'm trying to retrieve the actual executable of jobs but i did
>>>> not find how
>>>> to do it .. i would like to found this for both case when the job is
>>>> started
>>>> with sbatch or with srun.
>>>
>>> For running jobs:
>>>
>>> scontrol show job <job id>
>> well, this was the first thing i tried but i have a null command
>>
>> [root at alien ~]# scontrol show job 2794270
>> JobId=2794270 JobName=AliEn.4865.575
>> UserId=aliprod(1000) GroupId=aliprod(1000) MCS_label=N/A
>> Priority=13338 Nice=0 Account=aliprod QOS=normal WCKey=*
>> JobState=RUNNING Reason=None Dependency=(null)
>> Requeue=0 Restarts=0 BatchFlag=1 Reboot=0 ExitCode=0:0
>> RunTime=09:20:37 TimeLimit=1-00:00:00 TimeMin=N/A
>> SubmitTime=2020-06-19T02:22:45 EligibleTime=2020-06-19T02:22:45
>> AccrueTime=2020-06-19T02:22:45
>> StartTime=2020-06-19T02:32:10 EndTime=2020-06-20T02:32:10 Deadline=N/A
>> SuspendTime=None SecsPreSuspend=0 LastSchedEval=2020-06-19T02:32:10
>> Partition=alien AllocNode:Sid=alien.spacescience.ro:4865
>> ReqNodeList=(null) ExcNodeList=(null)
>> NodeList=alien-0-62
>> BatchHost=alien-0-62
>> NumNodes=1 NumCPUs=1 NumTasks=1 CPUs/Task=1 ReqB:S:C:T=0:0:*:*
>> TRES=cpu=1,mem=2600M,node=1,billing=1
>> Socks/Node=* NtasksPerN:B:S:C=0:0:*:* CoreSpec=*
>> MinCPUsNode=1 MinMemoryCPU=2600M MinTmpDiskNode=0
>> Features=(null) DelayBoot=00:00:00
>> OverSubscribe=OK Contiguous=0 Licenses=(null) Network=(null)
>> Command=(null)
>> WorkDir=/tmp
>> StdErr=/dev/null
>> StdIn=/dev/null
>> StdOut=/dev/null
>> Power=
>>
>> this is slurm 19.05.2 and my purpose is only for running jobs
>>
>> moreover it's not clear to me what are the steps of job submission and
>> the processes involved ..
>>
>> it seems to be that slurmstepd (with identification of jobid.batch)
>> start a slurm_script that i think that is the actually submitted script
>>
>> the job start a srun with a script (specified in submiited script)
>> and at the moment i get an slurmstepd[jobid.0] where this is run
>>
>> so, at this moment it would be enough if given a job id i can get the
>> submission script...
>>
>> is there a way to do it? (beside get the node from squeue and then ssh
>> on node and ps grep?)
>>
>> Thank you!
>> Adrian
>>
>>
>>>
>>> For completed jobs the information about the executable is not kept by
>>> the standard accounting mechanism. However, it is possible to extract
>>> more information yourself from either the prolog or epilog and save this
>>> somewhere.
>>>
>>> Cheers,
>>>
>>> Loris
>>>
>>
More information about the slurm-users
mailing list