[slurm-users] Dependencies problem with cfncluster

david martin vilanew at gmail.com
Wed Dec 20 07:33:17 MST 2017


Hi folks,

Not sure i should post this here but thought you may have seen this 
problem before.


I´m running slurm(16.05) together with cfncluster from aws and using 
autoscaling. It seems to work except for dependencies.


I always get an error:

_sbatch: error: Batch job submission falied: Job dependency problem_


I tested with a simple script:


#!/bin/sh

id=`sbatch --job-name=factor9-1 --ntasks=1 --ntasks-per-core=1 --output=out.slurmout jobscript`
echo "ntasks 1 jobid $id"

for n in 2 4 8 16 32 64 128; do
     id=`sbatch --depend=afterany:$id --job-name=factor9-$n --ntasks=$n --ntasks-per-core=1 --output=$n.slurmout jobscript`;
     echo "ntasks $n jobid $id"
done


jobscript file:

#! /bin/bash

echo $hostname


Looks like clustercfn is not aware of job dependencies. Or is it a slurm 
problem ?


Thanks,


David

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20171220/3e0d66c7/attachment-0001.html>


More information about the slurm-users mailing list