[slurm-users] backfill scheduler does not work for heterogeneous jobs (version 17.11)

Ana Jokanović anaj82 at gmail.com
Thu Nov 29 05:28:17 MST 2018


Hello,

I did a simple test submitting the workload of three jobs (see below) on a
cluster of 5 nodes:

sbatch --cpus-per-task=2 --ntasks=6 --time=15 : --cpus-per-task=2
--ntasks=6 --time=15 : --cpus-per-task=2 --ntasks=6 --time=15

sbatch --cpus-per-task=2 --ntasks=6 --time=15 : --cpus-per-task=2
--ntasks=6 --time=15 : --cpus-per-task=2 --ntasks=6 --time=15

sleep 5;

sbatch --ntasks=1 --time=2 : --ntasks=1 --time=1

I would expect that the third submitted job is backfilled but it does not
happen.
Here is the job completion log:

JobId=2 UserId=3113 GroupId=8950 Name=sleep JobState=COMPLETED
Partition=debug TimeLimit=00:15:00 SubmitTime=1543317694
StartTime=1543317714 EndTime=1543317774 NodeList=s19r2b09 NodeCnt=1
ProcCnt=48

JobId=3 UserId=3113 GroupId=8950 Name=sleep JobState=COMPLETED
Partition=debug TimeLimit=00:15:00 SubmitTime=1543317694
StartTime=1543317714 EndTime=1543317774 NodeList=s19r2b10 NodeCnt=1
ProcCnt=48

JobId=4 UserId=3113 GroupId=8950 Name=sleep JobState=COMPLETED
Partition=debug TimeLimit=00:15:00 SubmitTime=1543317694
StartTime=1543317714 EndTime=1543317774 NodeList=s19r2b12 NodeCnt=1
ProcCnt=48

JobId=8 UserId=3113 GroupId=8950 Name=sleep JobState=COMPLETED
Partition=debug TimeLimit=00:02:00 SubmitTime=1543317699
StartTime=1543317804 EndTime=1543317824 NodeList=s19r2b14 NodeCnt=1
ProcCnt=48

JobId=9 UserId=3113 GroupId=8950 Name=sleep JobState=COMPLETED
Partition=debug TimeLimit=00:01:00 SubmitTime=1543317699
StartTime=1543317804 EndTime=1543317824 NodeList=s19r2b16 NodeCnt=1
ProcCnt=48

JobId=5 UserId=3113 GroupId=8950 Name=sleep JobState=COMPLETED
Partition=debug TimeLimit=00:15:00 SubmitTime=1543317694
StartTime=1543317804 EndTime=1543317864 NodeList=s19r2b09 NodeCnt=1
ProcCnt=48

JobId=6 UserId=3113 GroupId=8950 Name=sleep JobState=COMPLETED
Partition=debug TimeLimit=00:15:00 SubmitTime=1543317694
StartTime=1543317804 EndTime=1543317864 NodeList=s19r2b10 NodeCnt=1
ProcCnt=48

JobId=7 UserId=3113 GroupId=8950 Name=sleep JobState=COMPLETED
Partition=debug TimeLimit=00:15:00 SubmitTime=1543317694
StartTime=1543317804 EndTime=1543317864 NodeList=s19r2b12 NodeCnt=1
ProcCnt=48


Would you expect this behavior?


Thanks.


Best regards,

Ana
-- 
Ana Jokanovic, PhD
Barcelona Supercomputing Center
c/ Jordi Girona 1-3, K2M Building, 1st floor
08034 Barcelona - SPAIN
e-mail: anaj82 at gmail.com or ana.jokanovic at bsc.es
tel: +34 93 4137246


-- 
Ana Jokanovic, PhD
Barcelona Supercomputing Center
c/ Jordi Girona 1-3, K2M Building, 1st floor
08034 Barcelona - SPAIN
e-mail: anaj82 at gmail.com or ana.jokanovic at bsc.es
tel: +34 93 4137246


-- 
Ana Jokanovic, PhD
Barcelona Supercomputing Center
c/ Jordi Girona 1-3, K2M Building, 1st floor
08034 Barcelona - SPAIN
e-mail: anaj82 at gmail.com or ana.jokanovic at bsc.es
tel: +34 93 4137246
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20181129/e59917ac/attachment.html>


More information about the slurm-users mailing list