[slurm-users] partition with several nodes not following name pattern
Vincent Berenz
vberenz at tuebingen.mpg.de
Mon Nov 13 06:37:51 MST 2017
Hi,
For example, this configuration in slurm.conf works fine:
NodeName=kilimanjaro CPUs=16 RealMemory=80419 Sockets=1
CoresPerSocket=8 ThreadsPerCore=2 State=UNKNOWN
PartitionName=slurmtest Nodes=kilimanjaro Default=YES
MaxTime=INFINITE State=UP
This configuration works also:
NodeName=falken CPUs=8 SocketsPerBoard=1 CoresPerSocket=4
ThreadsPerCore=2 RealMemory=64358 State=UNKNOWN
PartitionName=slurmtest Nodes=falken Default=YES MaxTime=INFINITE
State=UP
I would like now to use kilimanjaro and falken in the same partition. I
can not change their hostname. I tried:
NodeName=n1 NodeHostName=kilimanjaro CPUs=16 RealMemory=80419
Sockets=1 CoresPerSocket=8 ThreadsPerCore=2 State=UNKNOWN
NodeName=n2 NodeHostName=falken CPUs=8 SocketsPerBoard=1
CoresPerSocket=4 ThreadsPerCore=2 RealMemory=64358 State=UNKNOWN
PartitionName=slurmtest Nodes=n[1-2] Default=YES MaxTime=INFINITE
State=UP
But then job fails with error:
srun: error: Task launch for 58.0 failed on node n1: Invalid job
credential
srun: error: Application launch failed: Invalid job credential
srun: Job step aborted: Waiting up to 2 seconds for job step to finish.
srun: error: Timed out waiting for job step to complete
Anything I am doing wrong ?
Many thanks
Vincent
More information about the slurm-users
mailing list