[slurm-users] Building Slurm with UCX support

Matthias Leopold matthias.leopold at meduniwien.ac.at
Wed Jan 12 20:28:08 UTC 2022



Am 12.01.22 um 17:54 schrieb Matthias Leopold:
> Hi,
> 
> I'm compiling Slurm with ansible playbooks from NVIDIA deepops framework 
> (https://github.com/NVIDIA/deepops). I'm trying to add UCX support. How 
> can I tell if UCX is actually included in the resulting binaries 
> (without actually using Slurm)? I was looking at executables and *so 
> files with ldd, but found no reference to the UCX installation in 
> /usr/lib/ucx.
> 
> Background:
> - I'm struggling with the build system using a non-existent path 
> (PMIXP_UCX_LIBPATH=\"/usr/lib64\"). The last ugly hack was to create a 
> symlink from /usr/lib/ucx to /usr/lib64/ucx
> - I can't easily test actual operation of MPI with UCX because I'm on a 
> limited test/dev system and (frankly) because I'm not yet a MPI expert
> 
> The configure string used is:
> ./configure --prefix=/usr/local --disable-dependency-tracking 
> --disable-debug --disable-x11 --enable-really-no-cray 
> --enable-salloc-kill-cmd --with-hdf5=no --sysconfdir=/etc/slurm 
> --enable-pam --with-pam_dir=/lib/x86_64-linux-gnu/security 
> --with-shared-libslurm --without-rpath --with-pmix=/opt/deepops/pmix 
> --with-hwloc=/opt/deepops/hwloc --with-ucx=/usr
> 

it looks like a similar problem with the "/usr/lib64" path was reported 
in https://bugs.schedmd.com/show_bug.cgi?id=10288
I'm seeing exactly what is reported in comment 5 + 6
unfortunately it's not clear if the problem was resolved

Matthias



More information about the slurm-users mailing list