[slurm-users] Fwd: srun: error: Unable to allocate resources: Invalid partition name specified
Brian Andrus
toomuchit at gmail.com
Fri Jul 27 08:59:25 MDT 2018
You show you still have more that one partition with Default=YES.
There should one and only one that is set to YES.
That is the one partition that is used if it is not specified.
Brian Andrus
On 7/27/2018 6:34 AM, valeriana at cbpf.br wrote:
> Hi Merlin
>
>> Do you accidentally have more than one partition with Default=YES?
> It was. I changed to NO and I continue with the same error.
>
> [root at master ~]# scontrol show partition
> PartitionName=course
> AllowGroups=courseit AllowAccounts=ALL AllowQos=ALL
> AllocNodes=ALL Default=NO QoS=N/A
> DefaultTime=NONE DisableRootJobs=NO ExclusiveUser=NO GraceTime=0
> Hidden=NO
> MaxNodes=UNLIMITED MaxTime=UNLIMITED MinNodes=1 LLN=NO
> MaxCPUsPerNode=UNLIMITED
> Nodes=node[02-04,06,09-12]
> PriorityJobFactor=1 PriorityTier=1 RootOnly=NO ReqResv=NO
> OverSubscribe=NO
> OverTimeLimit=NONE PreemptMode=OFF
> State=UP TotalCPUs=64 TotalNodes=8 SelectTypeParameters=NONE
> DefMemPerNode=UNLIMITED MaxMemPerNode=UNLIMITED
>
> PartitionName=test
> AllowGroups=testcluster AllowAccounts=ALL AllowQos=ALL
> AllocNodes=ALL Default=YES QoS=N/A
> DefaultTime=NONE DisableRootJobs=NO ExclusiveUser=NO GraceTime=0
> Hidden=NO
> MaxNodes=UNLIMITED MaxTime=UNLIMITED MinNodes=1 LLN=NO
> MaxCPUsPerNode=UNLIMITED
> Nodes=node[01,05,07,08]
> PriorityJobFactor=1 PriorityTier=1 RootOnly=NO ReqResv=NO
> OverSubscribe=NO
> OverTimeLimit=NONE PreemptMode=OFF
> State=UP TotalCPUs=32 TotalNodes=4 SelectTypeParameters=NONE
> DefMemPerNode=UNLIMITED MaxMemPerNode=UNLIMITED
>
> /etc/slurm/slurm.conf
> PartitionName=course Nodes=node[02-04,06,09-12] AllowGroups=curseit
> Default=NO MaxTime=INFINITE State=UP
> PartitionName=test Nodes=node[01,05,07,08] AllowGroups=testcluster
> Default=YES MaxTime=INFINITE State=UP
>
> If I have a lot of partitions, how can I set a default partition to a
> distinct groups?
>
> In my slurm.conf file, I think that I have to set Default=YES to all
> main partition to my all distinct partitions
>
> For example:
>
> /etc/slurm/slurm.conf
>
> PartitionName=course Nodes=node[02-04,06,09-12] AllowGroups=curseit
> Default=YES MaxTime=INFINITE State=UP
> PartitionName=courset Nodes=node[13-20] AllowGroups=curseit Default=NO
> MaxTime=INFINITE State=UP
>
> PartitionName=test Nodes=node[01,05,07,08] AllowGroups=testcluster
> Default=YES MaxTime=INFINITE State=UP
> PartitionName=testc Nodes=node[21-30] AllowGroups=testcluster
> Default=NO MaxTime=INFINITE State=UP
>
> Thanks!!!
>
> Valeriana
>
> Citando Merlin Hartley <merlin at mrc-mbu.cam.ac.uk>:
>
>> Do you accidentally have more than one partition with Default=YES?
>>
>>
>> --
>> Merlin Hartley
>> Computer Officer
>> MRC Mitochondrial Biology Unit
>> University of Cambridge
>> Cambridge, CB2 0XY
>> United Kingdom
>>
>>> On 26 Jul 2018, at 16:57, valeriana at cbpf.br wrote:
>>>
>>> Hi all,
>>>
>>> I dont´t understand why its occurs!
>>>
>>> user: john
>>> group: courseit
>>> partition: course
>>>
>>> [john at master ~]$ sinfo
>>> PARTITION AVAIL TIMELIMIT NODES STATE NODELIST
>>> course up infinite 8 idle node[02-04,06,09-12]
>>>
>>> /etc/group
>>> courseit:x:1002:john
>>>
>>> /etc/passwd
>>> john:x:1001:1002::/home/john:/bin/bash
>>>
>>> /etc/slurm/slurm.conf
>>> PartitionName=course Nodes=node[02-04,06,09-12] AllowGroups=courseit
>>> Default=YES MaxTime=INFINITE State=UP
>>>
>>>
>>> [john at master ~]$ srun -N3 -l /bin/hostname
>>> srun: error: Unable to allocate resources: User's group not
>>> permitted to use this partition
>>>
>>> And if I put -p course, it´s ok
>>>
>>> [john at master ~]$ srun -p course -N3 -l /bin/hostname
>>> 2: node04
>>> 1: node03
>>> 0: node02
>>>
>>> Can someone has an idea?
>>>
>>> Thanks in advance!
>>> Valeriana
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>
> ----- Final da mensagem encaminhada -----
>
>
More information about the slurm-users
mailing list