[slurm-users] Slurm 19.05: can not submit job
Ole Holm Nielsen
Ole.H.Nielsen at fysik.dtu.dk
Thu Nov 28 10:19:56 UTC 2019
On 11/28/19 10:35 AM, Nguyen Dai Quy wrote:
> Hi list,
> I can not submit my job:
> > sbatch submit.sh
> sbatch: error: Batch job submission failed: Invalid account or
> account/partition combination specified
>
> After checking slurmdbd.log, I see:
>
> [2019-11-28T10:21:07.578] Accounting storage MYSQL plugin loaded
> [2019-11-28T10:21:07.586] slurmdbd version 19.05.4 started
> [2019-11-28T10:26:07.778] error: _add_registered_cluster: trying to
> register a cluster (cluster3) with no remote port
> [2019-11-28T10:30:14.936] Terminate signal (SIGINT or SIGTERM) received
> [2019-11-28T10:30:14.951] Unable to remove pidfile
> '/var/run/slurmdbd.pid': Permission denied
> [2019-11-28T10:30:15.038] Accounting storage MYSQL plugin loaded
> [2019-11-28T10:30:15.047] slurmdbd version 19.05.4 started
> [2019-11-28T10:31:07.997] error: _add_registered_cluster: trying to
> register a cluster (cluster3) with no remote port
>
> I used slurm 19.05 on CentOS 7.
> Any suggestions?
This could perhaps be a firewall problem. I suggest that you try to
validate your configuration by following my Slurm Wiki:
https://wiki.fysik.dtu.dk/niflheim/SLURM
In particular, you have to configure the CentOS 7 firewall correctly:
https://wiki.fysik.dtu.dk/niflheim/Slurm_configuration#configure-firewall-for-slurm-daemons
https://wiki.fysik.dtu.dk/niflheim/Slurm_configuration#firewall-between-slurmctld-and-slurmdbd
/Ole
More information about the slurm-users
mailing list