[slurm-users] ticking time bomb? launching too many jobs in parallel

Goetz, Patrick G pgoetz at math.utexas.edu
Thu Aug 29 16:48:36 UTC 2019


On 8/29/19 9:38 AM, Jarno van der Kolk wrote:
> Here's an example on how to do so from the Compute Canada docs:
> https://docs.computecanada.ca/wiki/GNU_Parallel#Running_on_Multiple_Nodes
> 

[name at server ~]$ parallel --jobs 32 --sshloginfile 
./node_list_${SLURM_JOB_ID} --env MY_VARIABLE --workdir $PWD ./my_program


To me it looks like you're circumventing the scheduler when you do this; 
maybe I'm missing something?

Also, where are these environment variables:

   SLURM_JOB_NODELIST, SLURM_JOB_ID

being set?





More information about the slurm-users mailing list