[slurm-users] Slurm Configless error
nicolas.sonoda at versatushpc.com.br
Tue Aug 29 16:02:34 UTC 2023
Thank you for the response!
It's strange because in the slurmctld log, the error I'm getting is:
error: _slurm_rpc_config_request: Rejected request as configless is disabled
De: slurm-users <slurm-users-bounces at lists.schedmd.com> em nome de Paul Brunk <pbrunk at uga.edu>
Enviado: terça-feira, 29 de agosto de 2023 12:40
Para: Slurm User Community List <slurm-users at lists.schedmd.com>
Assunto: Re: [slurm-users] Slurm Configless error
In my experience this usually means the compute node can’t talk to the slurmctld TCP port on the slurm controller (firewall?), or the controller host isn’t resolving the compute node’s name (short hostname vs FQDN, for example). I’d look at slurmctld and slurmd logs—you should see a useful complaint in one of those, whatever the cause.
Paul Brunk, system administrator
Advanced Computing Resource Center
Enterprise IT Svcs, the University of Georgia
On 8/29/23, 11:29 AM, "slurm-users" <slurm-users-bounces at lists.schedmd.com> wrote:
You don't often get email from nicolas.sonoda at versatushpc.com.br<mailto:nicolas.sonoda at versatushpc.com.br>. Learn why this is important <https://aka.ms/LearnAboutSenderIdentification><https://aka.ms/LearnAboutSenderIdentification%3e>
[EXTERNAL SENDER - PROCEED CAUTIOUSLY]
I'm encountering the following errors on my node:
Aug 29 12:24:48 n01 slurmd: error: _fetch_child: failed to fetch remote configs
Aug 29 12:24:48 n01 slurmd: error: _establish_configuration: failed to load configs
Aug 29 12:24:48 n01 slurmd: error: slurmd initialization failed
I disabled Slurm's Configless, copied the configuration files to the nodes, and restarted the services, but this error keeps being returned.
Could you please help me with this?
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the slurm-users