[slurm-users] What is the complete logic to calculate node number in job_submit.lua
Ole Holm Nielsen
Ole.H.Nielsen at fysik.dtu.dk
Mon Sep 26 13:04:37 UTC 2022
Hi Loris,
On 9/26/22 12:51, Loris Bennett wrote:
>> When designing restriction in job_submit.lua, I found there is no member in job_desc struct can directly be used to determine the node number finally allocated to a job. The job_desc.min_nodes seem to
>> be a close answer, but it will be 0xFFFFFFFE when user not specify –node option. Then in such case we think we can use job_desc.num_tasks and job_desc.ntasks_per_node to calculate node number.
>> But again, we find ntasks_per_node may also be default value 0xFFFE if user not specify related option.
>>
>> So what is the complete and elegant way to predict the job node number in job_submit.lua in all case, no matter how user write their submit options?
>
> I don't think you can expect to know the node(s) a job will eventually
> run on at submission time. How would this work? Resources will become
> available earlier than Slurm expects, if jobs finish before the given
> time-time (or if they crash). If your are using fairshare, jobs can be
> scheduled which have a higher priority than the currently waiting jobs.
>
> What is your use-case for needing to know the node the job will run on?
I think he meant the *number of nodes*, and not the *hostnames* of the
compute nodes selected by Slurm at a later time.
/Ole
More information about the slurm-users
mailing list