Hi,
I have munge running on the controller and nodes fine, all tests passed.
I have slurmctld running on the controller ok after checking the logs /var/spool/slurmctld was not created which I assume should have happened via the rpm install?
Anyway I cant get slurmd to run on the warewulf nodes and cant find any log to check?
How to fault find this?
regards
Steven
Hi Steven,
On 03-12-2024 19:34, Steven Jones via slurm-users wrote:
I have munge running on the controller and nodes fine, all tests passed.
I have slurmctld running on the controller ok after checking the logs / var/spool/slurmctld was not created which I assume should have happened via the rpm install?
Anyway I cant get slurmd to run on the warewulf nodes and cant find any log to check?
How to fault find this?
It seems you're missing some extra steps. You may find some useful information in this Wiki page: https://wiki.fysik.dtu.dk/Niflheim_system/Slurm_installation/#installing-rpm...
IHTH, Ole
HI,
Does the slurm user need to be <1000UID? Using IPA with a UID of
[root@vuwunicoslurmd1 slurm]# id slurm uid=126209577(slurm) gid=126209576(slurm) groups=126209576(slurm)
regards
Steven
________________________________ From: Ole Holm Nielsen via slurm-users slurm-users@lists.schedmd.com Sent: Wednesday, 4 December 2024 7:46 am To: slurm-users@lists.schedmd.com slurm-users@lists.schedmd.com Subject: [slurm-users] Re: slurmd on a warwwulf node - not running
[You don't often get email from slurm-users@lists.schedmd.com. Learn why this is important at https://aka.ms/LearnAboutSenderIdentification ]
Hi Steven,
On 03-12-2024 19:34, Steven Jones via slurm-users wrote:
I have munge running on the controller and nodes fine, all tests passed.
I have slurmctld running on the controller ok after checking the logs / var/spool/slurmctld was not created which I assume should have happened via the rpm install?
Anyway I cant get slurmd to run on the warewulf nodes and cant find any log to check?
How to fault find this?
It seems you're missing some extra steps. You may find some useful information in this Wiki page: https://apc01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwiki.fysik...https://wiki.fysik.dtu.dk/Niflheim_system/Slurm_installation/#installing-rpms
IHTH, Ole
-- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-leave@lists.schedmd.com
Hi, No it doesn’t need to be below 1000. Best Andreas
Am 03.12.2024 um 22:08 schrieb Steven Jones via slurm-users slurm-users@lists.schedmd.com:
HI,
Does the slurm user need to be <1000UID? Using IPA with a UID of
[root@vuwunicoslurmd1 slurm]# id slurm uid=126209577(slurm) gid=126209576(slurm) groups=126209576(slurm)
regards
Steven
________________________________ From: Ole Holm Nielsen via slurm-users slurm-users@lists.schedmd.com Sent: Wednesday, 4 December 2024 7:46 am To: slurm-users@lists.schedmd.com slurm-users@lists.schedmd.com Subject: [slurm-users] Re: slurmd on a warwwulf node - not running
[You don't often get email from slurm-users@lists.schedmd.com. Learn why this is important at https://aka.ms/LearnAboutSenderIdentification ]
Hi Steven,
On 03-12-2024 19:34, Steven Jones via slurm-users wrote:
I have munge running on the controller and nodes fine, all tests passed.
I have slurmctld running on the controller ok after checking the logs / var/spool/slurmctld was not created which I assume should have happened via the rpm install?
Anyway I cant get slurmd to run on the warewulf nodes and cant find any log to check?
How to fault find this?
It seems you're missing some extra steps. You may find some useful information in this Wiki page: https://apc01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwiki.fysik...https://wiki.fysik.dtu.dk/Niflheim_system/Slurm_installation/#installing-rpms
IHTH, Ole
-- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-leave@lists.schedmd.com
-- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-leave@lists.schedmd.com