Nick,
Presuming you have followed the SchedMD instructions, you should be able to get a session in your login container:
kubectl --namespace=slurm exec -it statefulsets/slurm-controller -- bash --login
From there, you can do any standard testing you like. Simple 'srun hostname' should work to let you know slurm itself is doing it's part.
You can also do commands such as 'scontrol show nodes' to see what and how many resources you have configured.
Outside that, you need to ensure you have configured your slurm containers to request the resources you plan on using (eg: gpus) and that you have enough of them for the script(s) you wish to run.
Brian Andrus
On 6/6/2025 12:31 PM, Nicolas Bock via slurm-users wrote:
Hi,
I was wondering whether there is any integration testing done on the containers and the operator published under the slinky project. Or asked differently, once I have deployed slinky, what tests should I run to confirm that the cluster is working as intended?
Thanks!
Nick