[slurm-users] Having issue in running Job using tensorflow

sudhagar s sudhagar2k5 at gmail.com
Tue Apr 16 14:12:54 UTC 2019


sh-4.3# srun  -N 2 -n 40 -t 24:00:00 job.sh
srun: error: timeout waiting for task launch, started 0 of 40 tasks
srun: Job step 13.0 aborted before step completely launched.
srun: Job step aborted: Waiting up to 32 seconds for job step to finish.
slurmstepd: error: *** STEP 13.0 ON ozd2485u CANCELLED AT
2019-04-16T19:35:26 ***
srun: error: Timed out waiting for job step to complete
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20190416/aca13038/attachment.html>


More information about the slurm-users mailing list