<div dir="ltr"><div class="gmail_signature"><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div style="font-size:12.800000190734863px">Both my frontEnd and compute node has the same `UID` and `GID` but I am having same error on [<a href="https://github.com/CSCfi/ansible-role-slurm/issues/11">slurm uid and gid must be consistent across the cluster</a>]. How can I fix this problem? </div><div style="font-size:12.800000190734863px"><br></div><div style="font-size:12.800000190734863px"><b>*My frontEnd:*</b></div><div style="font-size:12.800000190734863px"><br></div><div style="font-size:12.800000190734863px"> $id </div><div style="font-size:12.800000190734863px"> uid=1000(alper) gid=1003(alper) groups=1003(alper),27(sudo),<wbr>999(docker)</div><div style="font-size:12.800000190734863px"><br></div><div style="font-size:12.800000190734863px"><b>*My compute node:*</b> (I have updated its gid it was 1001 before. I am not sure whether slurm sees its updated version or not.)</div><div style="font-size:12.800000190734863px"><br></div><div style="font-size:12.800000190734863px"> $id</div><div style="font-size:12.800000190734863px"> uid=1000(alper) gid=1003(alper) groups=1003(alper),4(adm),30(<wbr>dip),44(video),46(plugdev),<wbr>1000(google-sudoers)</div><div style="font-size:12.800000190734863px"><br></div><div style="font-size:12.800000190734863px">--------------</div><div style="font-size:12.800000190734863px"><br></div><div style="font-size:12.800000190734863px"><b>Log from slurmd:</b></div><div style="font-size:12.800000190734863px"><br></div><div style="font-size:12.800000190734863px"><br></div><div style="font-size:12.800000190734863px"> slurmd: debug2: got this type of message 4005</div><div style="font-size:12.800000190734863px"> slurmd: debug2: Processing RPC: REQUEST_BATCH_JOB_LAUNCH</div><div style="font-size:12.800000190734863px"> slurmd: error: Security violation, batch launch RPC from uid 1000</div><div style="font-size:12.800000190734863px"> slurmd: debug3: in the service_connection</div><div style="font-size:12.800000190734863px"> slurmd: debug2: got this type of message 6011</div><div style="font-size:12.800000190734863px"> slurmd: debug2: Processing RPC: REQUEST_TERMINATE_JOB</div><div style="font-size:12.800000190734863px"> slurmd: debug: _rpc_terminate_job, uid = 1000</div><div style="font-size:12.800000190734863px"> slurmd: error: Security violation: kill_job(26) from uid 1000</div><div style="font-size:12.800000190734863px"> slurmd: debug3: in the service_connection</div><div style="font-size:12.800000190734863px"> slurmd: debug3: in the service_connection</div><div style="font-size:12.800000190734863px"> slurmd: debug2: got this type of message 6011</div><div style="font-size:12.800000190734863px"> slurmd: debug2: Processing RPC: REQUEST_TERMINATE_JOB</div><div style="font-size:12.800000190734863px"> slurmd: debug: _rpc_terminate_job, uid = 1000</div><div style="font-size:12.800000190734863px"> slurmd: error: Security violation: kill_job(24) from uid 1000</div><div style="font-size:12.800000190734863px"> slurmd: debug2: got this type of message 6011</div><div style="font-size:12.800000190734863px"> slurmd: debug2: Processing RPC: REQUEST_TERMINATE_JOB</div><div style="font-size:12.800000190734863px"> slurmd: debug: _rpc_terminate_job, uid = 1000</div><div style="font-size:12.800000190734863px"> slurmd: error: Security violation: kill_job(25) from uid 1000</div><div style="font-size:12.800000190734863px"> </div><div style="font-size:12.800000190734863px"> slurmd: debug3: in the service_connection</div><div style="font-size:12.800000190734863px"> slurmd: debug2: got this type of message 1008</div><div style="font-size:12.800000190734863px"> slurmd: error: Security violation, ping RPC from uid 1000</div><div style="font-size:12.800000190734863px"> slurmd: error: Do you have SlurmUser configured as uid 1000?</div><div style="font-size:12.800000190734863px"><br></div><div style="font-size:12.800000190734863px"><b>Log from slurmctld:</b></div><div style="font-size:12.800000190734863px"><br></div><div style="font-size:12.800000190734863px"> slurmctld: debug2: node_did_resp instance-3</div><div style="font-size:12.800000190734863px"> slurmctld: debug2: agent maximum delay 1 seconds</div><div style="font-size:12.800000190734863px"> slurmctld: debug2: Tree head got back 1</div><div style="font-size:12.800000190734863px"> slurmctld: agent/is_node_resp: node:instance-3 RPC:REQUEST_TERMINATE_JOB : Invalid user id</div><div style="font-size:12.800000190734863px"> slurmctld: debug: node_not_resp: node instance-3 responded since msg sent</div><div style="font-size:12.800000190734863px"><br></div><div style="font-size:12.800000190734863px"> </div></div></div></div></div></div></div></div>
</div>