Just double checking. Can you check on your worker node
1. ls -la /etc/pam.d/*slurm*
(just checking if there's a specific pam file for slurmd on your system)
1. scontrol show config | grep -i SlurmdUser
(checking if slurmd is set up with a different user to SlurmUser)
1. grep slurm /etc/passwd
Sean
________________________________ From: Steven Jones via slurm-users slurm-users@lists.schedmd.com Sent: Tuesday, 4 February 2025 08:56 To: slurm-users@lists.schedmd.com slurm-users@lists.schedmd.com; Christopher Samuel chris@csamuel.org Subject: [EXT] [slurm-users] Re: Fw: Re: RHEL8.10 V slurmctld
External email: Please exercise caution
________________________________ I rebuilt 4 nodes as rocky9.5
8><--- [2025-02-03T21:40:11.978] Node node6 now responding [2025-02-03T21:41:15.698] _slurm_rpc_submit_batch_job: JobId=17 InitPrio=4294901759 usec=501 [2025-02-03T21:41:16.055] sched: Allocate JobId=17 NodeList=node6 #CPUs=1 Partition=debug [2025-02-03T21:41:16.059] Killing non-startable batch JobId=17: Invalid user id [2025-02-03T21:41:16.059] _job_complete: JobId=17 WEXITSTATUS 1 [2025-02-03T21:41:16.060] _job_complete: JobId=17 done
So same error RHEL9.5 to Rocky9.5
🙁
Unless I am missing some sort of config setting, I am out of permutations I can try.
regards
Steven
________________________________ From: Christopher Samuel via slurm-users slurm-users@lists.schedmd.com Sent: Tuesday, 4 February 2025 10:13 am To: slurm-users@lists.schedmd.com slurm-users@lists.schedmd.com Subject: [slurm-users] Re: Fw: Re: RHEL8.10 V slurmctld
On 2/3/25 2:33 pm, Steven Jones via slurm-users wrote:
Just built 4 x rocky9 nodes and I do not get that error (but I get another I know how to fix, I think) so holistically I am thinking the version difference is too large.
Oh I think I missed this - when you say version difference do you mean the Slurm version or the distro version?
I was assuming you were building your Slurm versions yourselves for both, but that may be way off the mark, sorry!
What are the Slurm versions everywhere?
All the best, Chris -- Chris Samuel : https://apc01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.csamuel...http://www.csamuel.org/ : Berkeley, CA, USA
-- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-leave@lists.schedmd.com