[slurm-users] Slurm - Munge configuration details
Benson Muite
benson_muite at emailplus.org
Tue Feb 2 12:40:10 UTC 2021
On 2/2/21 3:30 PM, Zainul Abiddin wrote:
> Hi All,
> I am new to Slurm and trying to setup Slurm20.11.2 on Centos 7
> My environment is Master node (smaster) + compute Node (snode)
> and i am using
> https://www.slothparadise.com/how-to-install-slurm-on-centos-7-cluster/
> <https://www.slothparadise.com/how-to-install-slurm-on-centos-7-cluster/> link
> to setup Slurm on Master and compute nodes.
>
> I have tried installing Munge on both the nodes and it's running fine.
> However when i try to run the Munge command from Master to Node its
> asking password.
>
> export MUNGEUSER=1001
> groupadd -g $MUNGEUSER munge
> useradd -m -c "MUNGE Uid 'N' Gid Emporium" -d /var/lib/munge -u
> $MUNGEUSER -g munge -s /sbin/nologin munge
> export SlurmUSER=1002
> groupadd -g $SlurmUSER slurm
> useradd -m -c "Slurm workload manager" -d /var/lib/slurm -u $SlurmUSER
> -g slurm -s /bin/bash slurm
>
> yum install -y epel-release
> yum install munge munge-libs munge-devel -y
>
> yum install rng-tools -y
> rngd -r /dev/urandom
> /usr/sbin/create-munge-key -r
> dd if=/dev/urandom bs=1 count=1024 > /etc/munge/munge.key
> chown munge: /etc/munge/munge.key
> chmod 400 /etc/munge/munge.key
>
> scp /etc/munge/munge.key root at snode:/etc/munge
>
> chown munge: /etc/munge/munge.key
> chmod 400 /etc/munge/munge.key
> chown -R munge: /etc/munge/ /var/log/munge/
> chmod 0700 /etc/munge/ /var/log/munge/
>
> systemctl enable munge
> systemctl start munge
> systemctl status munge
>
> [root at smaster ~]# systemctl status munge
> ? munge.service - MUNGE authentication service
> Loaded: loaded (/usr/lib/systemd/system/munge.service; enabled;
> vendor preset: disabled)
> Active: active (running) since Mon 2021-02-01 12:52:54 IST; 1h 4min ago
> Docs: man:munged(8)
> Process: 2547 ExecStart=/usr/sbin/munged (code=exited, status=0/SUCCESS)
> Main PID: 2550 (munged)
> Tasks: 4
> CGroup: /system.slice/munge.service
> +-2550 /usr/sbin/munged
>
> Feb 01 12:52:54 smaster.calligotech.com <http://smaster.calligotech.com>
> systemd[1]: Starting MUNGE authentication service...
> Feb 01 12:52:54 smaster.calligotech.com <http://smaster.calligotech.com>
> systemd[1]: Started MUNGE authentication service.
> [root at smaster ~]# munge -n
> MUNGE:AwQDAAAg5PQzQhz/D4h7OGUU4Cx4QAgZ4z/0MMt0SP+uhuP927Xcl2t8EC4izsUj6xpMRslnIb2g4RCz2vayu0wW1o8mNNuy7cVv/PmsuO9XsAJ7aLl1n/M=:
> [root at smaster ~]#
>
> Below is the screenshot for reference.
> Smaster:
> image.png
>
> Snode:
> image.png
>
> Am I configuring properly or Do I need to set up passwordless
> authentication on Master to Node and vice-versa?
>
> Please clarify to me, whether Mugne will do passwordless login else
> do we need to setup passwordless.
>
> Please guide me with a proper setup link/Doc which includes Munge
> Configuration, Slurm account database Daemon configuration and Slurm
> installation and configuration with testing simple jobs on Master and
> Compute Nodes.
>
> --
> *Regards*
> *Zain*
>
Are you able to do passwordless ssh between the nodes?
May also find the following helpful:
https://github.com/dun/munge/wiki/Installation-Guide
https://southgreenplatform.github.io/trainings/hpc/slurminstallation/
More information about the slurm-users
mailing list