[slurm-users] pam_slurm_adopt not working for all users

Tina Friedrich tina.friedrich at it.ox.ac.uk
Fri May 21 15:35:39 UTC 2021


Hi Loris,

I'm not an PAM expert, but - pam_slurm_adopt doesn't do authenticatio, 
it only verifies that access for the authenticated user is allowed (by 
checking there's a job). 'account' not 'auth' in PAM config. As in, it's 
got nothing to do with how the user logs in to the server / is 
authenticated by the server.

So yes, I'd expect this. For SSH logins to work, users need to, well, be 
able to log in via ssh. Key based, password auth, host-based SSH, 
Kerberos, ... - whatever auth mechanism your PAM config's configured to 
use (or whatever you've configured in sshd_config).

If this is simply about quickly accessing nodes that they have jobs on 
to check on them - we tell our users to 'srun' into a job allocation 
(srun --jobid=XXXXXX).

Tina

On 21/05/2021 13:53, Loris Bennett wrote:
> Hi,
> 
> We have set up pam_slurm_adopt using the official Slurm documentation
> and Ole's information on the subject.  It works for a user who has SSH
> keys set up, albeit the passphrase is needed:
> 
>    $ salloc --partition=gpu --gres=gpu:1 --qos=hiprio --ntasks=1 --time=00:30:00 --mem=100
>    salloc: Granted job allocation 7202461
>    salloc: Waiting for resource configuration
>    salloc: Nodes g003 are ready for job
> 
>    $ ssh g003
>    Warning: Permanently added 'g003' (ECDSA) to the list of known hosts.
>    Enter passphrase for key '/home/loris/.ssh/id_rsa':
>    Last login: Wed May  5 08:50:00 2021 from login.curta.zedat.fu-berlin.de
> 
>    $ ssh g004
>    Warning: Permanently added 'g004' (ECDSA) to the list of known hosts.
>    Enter passphrase for key '/home/loris/.ssh/id_rsa':
>    Access denied: user loris (uid=182317) has no active jobs on this node.
>    Access denied by pam_slurm_adopt: you have no active jobs on this node
>    Authentication failed.
> 
> If SSH keys are not set up, then the user is asked for a password:
> 
>    $ squeue --me
>                 JOBID PARTITION     NAME     USER ST       TIME  NODES NODELIST(REASON)
>               7201647      main test_job nokeylee  R    3:45:24      1 c005
>               7201646      main test_job nokeylee  R    3:46:09      1 c005
>    $ ssh c005
>    Warning: Permanently added 'c005' (ECDSA) to the list of known hosts.
>    nokeylee at c005's password:
> 
> My assumption was that a user should be able to log into a node on which
> that person has a running job without any further ado, i.e. without the
> necessity to set up anything else or to enter any credentials.
> 
> Is this assumption correct?
> 
> If so, how can I best debug what I have done wrong?
> 
> Cheers,
> 
> Loris
> 

-- 
Tina Friedrich, Advanced Research Computing Snr HPC Systems Administrator

Research Computing and Support Services
IT Services, University of Oxford
http://www.arc.ox.ac.uk http://www.it.ox.ac.uk



More information about the slurm-users mailing list