[slurm-users] [slurm 17.02] select/cray plugin from non-crays
Andrew Elwell
andrew.elwell at gmail.com
Wed Jan 10 22:15:29 MST 2018
Hi folks,
We've just upgraded to slurm 17.02.9 (native) on our Crays, but can't
get sinfo to work on them anymore from a non-cray
"sinfo: error: Cluster 'galaxy' has an unknown select plugin_id 108"
On the Crays we have
aelwell at galaxy-int:~/testjobs/native$ grep -i select /etc/opt/slurm/slurm.conf
SelectType=select/cray
SelectTypeParameters=CR_ONE_TASK_PER_CORE,CR_CORE_Memory,other_cons_res
aelwell at galaxy-int:~/testjobs/native$
and on the non-cray node I'm trying to get working (an admin node we
use for monitoring the job Qs across the site)
hpc-admin2:~ # zypper up
Refreshing service 'SMT-https_target_pawsey_org_au'.
Loading repository data...
Reading installed packages...
Nothing to do.
hpc-admin2:~ # rpm -qa | grep slurm
slurm-munge-17.02.9-6.10.1.x86_64
slurm-17.02.9-6.10.1.x86_64
slurm-plugins-17.02.9-6.10.1.x86_64
hpc-admin2:~ # ldd /usr/lib64/slurm/select_cray.so
linux-vdso.so.1 (0x00007ffdf9db1000)
libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fcd37d89000)
libc.so.6 => /lib64/libc.so.6 (0x00007fcd379e6000)
/lib64/ld-linux-x86-64.so.2 (0x00007fcd381b9000)
hpc-admin2:~ # sinfo --version
slurm 17.02.9
hpc-admin2:~ # sacctmgr show clusters | grep gala
galaxy 146.118.55.132 6817 7936 1
normal
hpc-admin2:~ # sinfo -M galaxy
sinfo: error: Cluster 'galaxy' has an unknown select plugin_id 108
sinfo: error: 'galaxy' can't be reached now, or it is an invalid entry
for --cluster. Use 'sacctmgr list clusters' to see available
clusters.
hpc-admin2:~ #
Is there a workaround to get non-crays talking to cray slurmctlds?
Many thanks,
Andrew
More information about the slurm-users
mailing list