[slurm-users] slurm: error: Security violation, ping RPC from uid 1000

Alper Alimoglu alper.alimoglu at gmail.com
Mon May 14 09:08:21 MDT 2018


Both my frontEnd and compute node has the same `UID` and `GID` but I am
having same error on [slurm uid and gid must be consistent across the
cluster <https://github.com/CSCfi/ansible-role-slurm/issues/11>]. How can I
fix this problem?

**My frontEnd:**

    $id
    uid=1000(alper) gid=1003(alper) groups=1003(alper),27(sudo),999(docker)

**My compute node:** (I have updated its gid it was 1001 before. I am not
sure whether slurm sees its updated version or not.)

    $id
    uid=1000(alper) gid=1003(alper) groups=1003(alper),4(adm),30(
dip),44(video),46(plugdev),1000(google-sudoers)

--------------

*Log from slurmd:*


    slurmd: debug2: got this type of message 4005
    slurmd: debug2: Processing RPC: REQUEST_BATCH_JOB_LAUNCH
    slurmd: error: Security violation, batch launch RPC from uid 1000
    slurmd: debug3: in the service_connection
    slurmd: debug2: got this type of message 6011
    slurmd: debug2: Processing RPC: REQUEST_TERMINATE_JOB
    slurmd: debug:  _rpc_terminate_job, uid = 1000
    slurmd: error: Security violation: kill_job(26) from uid 1000
    slurmd: debug3: in the service_connection
    slurmd: debug3: in the service_connection
    slurmd: debug2: got this type of message 6011
    slurmd: debug2: Processing RPC: REQUEST_TERMINATE_JOB
    slurmd: debug:  _rpc_terminate_job, uid = 1000
    slurmd: error: Security violation: kill_job(24) from uid 1000
    slurmd: debug2: got this type of message 6011
    slurmd: debug2: Processing RPC: REQUEST_TERMINATE_JOB
    slurmd: debug:  _rpc_terminate_job, uid = 1000
    slurmd: error: Security violation: kill_job(25) from uid 1000

    slurmd: debug3: in the service_connection
    slurmd: debug2: got this type of message 1008
    slurmd: error: Security violation, ping RPC from uid 1000
    slurmd: error: Do you have SlurmUser configured as uid 1000?

*Log from slurmctld:*

    slurmctld: debug2: node_did_resp instance-3
    slurmctld: debug2: agent maximum delay 1 seconds
    slurmctld: debug2: Tree head got back 1
    slurmctld: agent/is_node_resp: node:instance-3
RPC:REQUEST_TERMINATE_JOB : Invalid user id
    slurmctld: debug:  node_not_resp: node instance-3 responded since msg
sent
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20180514/e317ef1a/attachment.html>


More information about the slurm-users mailing list