[slurm-users] ticking time bomb? launching too many jobs in parallel
Goetz, Patrick G
pgoetz at math.utexas.edu
Thu Aug 29 16:48:36 UTC 2019
On 8/29/19 9:38 AM, Jarno van der Kolk wrote:
> Here's an example on how to do so from the Compute Canada docs:
> https://docs.computecanada.ca/wiki/GNU_Parallel#Running_on_Multiple_Nodes
>
[name at server ~]$ parallel --jobs 32 --sshloginfile
./node_list_${SLURM_JOB_ID} --env MY_VARIABLE --workdir $PWD ./my_program
To me it looks like you're circumventing the scheduler when you do this;
maybe I'm missing something?
Also, where are these environment variables:
SLURM_JOB_NODELIST, SLURM_JOB_ID
being set?
More information about the slurm-users
mailing list