[slurm-users] partition with several nodes not following name	pattern
    Vincent Berenz 
    vberenz at tuebingen.mpg.de
       
    Mon Nov 13 06:37:51 MST 2017
    
    
  
Hi,
For example, this configuration in slurm.conf works fine:
   NodeName=kilimanjaro CPUs=16 RealMemory=80419 Sockets=1 
CoresPerSocket=8 ThreadsPerCore=2 State=UNKNOWN
   PartitionName=slurmtest Nodes=kilimanjaro Default=YES 
MaxTime=INFINITE State=UP
This configuration works also:
   NodeName=falken CPUs=8 SocketsPerBoard=1 CoresPerSocket=4 
ThreadsPerCore=2 RealMemory=64358 State=UNKNOWN
   PartitionName=slurmtest Nodes=falken Default=YES MaxTime=INFINITE 
State=UP
I would like now to use kilimanjaro and falken in the same partition. I 
can not change their hostname. I tried:
   NodeName=n1 NodeHostName=kilimanjaro CPUs=16 RealMemory=80419 
Sockets=1 CoresPerSocket=8 ThreadsPerCore=2 State=UNKNOWN
   NodeName=n2 NodeHostName=falken CPUs=8 SocketsPerBoard=1 
CoresPerSocket=4 ThreadsPerCore=2 RealMemory=64358 State=UNKNOWN
   PartitionName=slurmtest Nodes=n[1-2] Default=YES MaxTime=INFINITE 
State=UP
But then job fails with error:
   srun: error: Task launch for 58.0 failed on node n1: Invalid job 
credential
   srun: error: Application launch failed: Invalid job credential
   srun: Job step aborted: Waiting up to 2 seconds for job step to finish.
   srun: error: Timed out waiting for job step to complete
Anything I am doing wrong ?
Many thanks
Vincent
    
    
More information about the slurm-users
mailing list