<div dir="ltr"><div>From this tutorial<a href="https://www.brightcomputing.com/blog/bid/174099/slurm-101-basic-slurm-usage-for-linux-clusters">https://www.brightcomputing.com/blog/bid/174099/slurm-101-basic-slurm-usage-for-linux-clusters</a> I am trying to run the below and it always fails. I've made sure to run 'module load slurm'. What could be wrong? Logs from slurmctld show ok:</div><div>[2019-07-09T10:19:44.183] prolog_running_decr: Configuration for JobID=402 is complete<br>[2019-07-09T10:19:44.266] _job_complete: JobID=402 State=0x1 NodeCnt=1 WEXITSTATUS 1<br>[2019-07-09T10:19:44.266] _job_complete: JobID=402 State=0x8005 NodeCnt=1 done<br>[2019-07-09T10:21:31.934] _slurm_rpc_submit_batch_job: JobId=403 InitPrio=4294901690 usec=321<br></div><br>cat slurm-job.sh<br>#!/usr/bin/bash<br><br>#SBATCH -o slurm.sh.out<br>#SBATCH -p defq<br><br>echo "In the directory: `pwd`"<br>echo "As the user: `whoami`"<br>echo "write this is a file" > analysis.output<br>sleep 60<br><br><div>scontrol show job 402<br>JobId=402 JobName=slurm-job.sh<br>   UserId=root(0) GroupId=root(0) MCS_label=N/A<br>   Priority=4294901691 Nice=0 Account=root QOS=normal<br>   JobState=FAILED Reason=NonZeroExitCode Dependency=(null)<br>   Requeue=1 Restarts=0 BatchFlag=1 Reboot=0 ExitCode=1:0<br>   RunTime=00:00:01 TimeLimit=365-00:00:00 TimeMin=N/A<br>   SubmitTime=2019-07-09T10:19:43 EligibleTime=2019-07-09T10:19:43<br>   StartTime=2019-07-09T10:19:43 EndTime=2019-07-09T10:19:44 Deadline=N/A<br>   PreemptTime=None SuspendTime=None SecsPreSuspend=0<br>   LastSchedEval=2019-07-09T10:19:43<br>   Partition=defq AllocNode:Sid=ciscluster:349904<br>   ReqNodeList=(null) ExcNodeList=(null)<br>   NodeList=node001<br>   BatchHost=node001<br>   NumNodes=1 NumCPUs=1 NumTasks=0 CPUs/Task=1 ReqB:S:C:T=0:0:*:*<br>   TRES=cpu=1,node=1,billing=1<br>   Socks/Node=* NtasksPerN:B:S:C=0:0:*:* CoreSpec=*<br>   MinCPUsNode=1 MinMemoryNode=0 MinTmpDiskNode=0<br>   Features=(null) DelayBoot=00:00:00<br>   Gres=(null) Reservation=(null)<br>   OverSubscribe=YES Contiguous=0 Licenses=(null) Network=(null)<br>   Command=/root/testing/slurm-job.sh<br>   WorkDir=/root/testing<br>   StdErr=/root/testing/slurm.sh.out<br>   StdIn=/dev/null<br>   StdOut=/root/testing/slurm.sh.out<br>   Power=<br></div></div>