[slurm-users] error: user <UID> not found
Brian Andrus
toomuchit at gmail.com
Wed Sep 30 01:49:44 UTC 2020
That means the system has no idea who that user is.
If you are using /etc/passwd, that file is not synched on the slurm
master node(s)
If you are part of a domain or other shared directory (ldap, etc), your
master is likely not configured right.
If you are using SSSD, it is also possible your sssd has too long of a
cache time. Run "sss_cache -E" to clear everything.
Note: that doesn't always work well, so you can stop sssd, delete the
cache and restart.
If you have a forest, it could be the information has not propagated to
all the servers, so you have to wait.
I've been places where that can take 24 hours.
Brian Andrus
On 9/29/2020 6:18 AM, Diego Zuccato wrote:
> Hello all.
>
> One of the users is unable to submit jobs to our cluster.
>
> The first time he tries, he gets
> $ sbatch test.job
> sbatch: fatal: Invalid user id: 621049927
> then:
> $ sbatch test.job
> sbatch: error: Batch job submission failed: Invalid account or
> account/partition combination specified
>
> In slurmctl.log I see:
> [2020-09-29T13:55:20.018] error: User 621049927 not found
> [2020-09-29T13:55:20.019] _job_create: invalid account or partition for
> user 621049927, account '(null)', and partition 'matrix'
> [2020-09-29T13:55:20.019] _slurm_rpc_submit_batch_job: Invalid account
> or account/partition combination specified
>
> (621049927 is the user's correct UID).
>
> I suspected a PBIS glitch, but both "id studenti#name.surname" and "id
> UID" returned the correct info.
>
> I found an old thread suggesting to add DefaultAccount, and I tried that
> even if it was already done by the script that imports AD users in Slurm
> accounting.
> # sacctmgr modify user studenti#name.surname cluster=oph set
> DefaultAccount=materia
>
> The db seems OK:
> # sacctmgr list user STUDENTI#name.surname WithAssoc
> User Def Acct Admin Cluster Account Partition
> Share MaxJobs MaxNodes MaxCPUs MaxSubmit MaxWall MaxCPUMins
> QOS Def QOS
> ---------- ---------- --------- ---------- ---------- ----------
> --------- ------- -------- -------- --------- ----------- -----------
> -------------------- ---------
> studenti#+ materia None oph materia
> 2 20
> normal
>
> What am I missing?
>
> Tks.
>
More information about the slurm-users
mailing list