[slurm-users] Heterogeneous job one MPI_COMM_WORLD

Christopher Benjamin Coffey Chris.Coffey at nau.edu
Wed Oct 10 08:27:46 MDT 2018


That is interesting. It is disabled in 17.11.10:

static bool _enable_pack_steps(void)
{
        bool enabled = false;
        char *sched_params = slurm_get_sched_params();

        if (sched_params && strstr(sched_params, "disable_hetero_steps"))
                enabled = false;
        else if (sched_params && strstr(sched_params, "enable_hetero_steps"))
                enabled = true;
        else if (mpi_type && strstr(mpi_type, "none"))
                enabled = true;
        xfree(sched_params);
        return enabled;
}

I wonder if it is ill advised to enable it!? Suppose I could try it. Thanks Chris!

Best,
Chris

—
Christopher Coffey
High-Performance Computing
Northern Arizona University
928-523-1167
 

On 10/10/18, 12:11 AM, "slurm-users on behalf of Chris Samuel" <slurm-users-bounces at lists.schedmd.com on behalf of chris at csamuel.org> wrote:

    On 10/10/18 05:07, Christopher Benjamin Coffey wrote:
    
    > Yet, we get an error: " srun: fatal: Job steps that span multiple
    > components of a heterogeneous job are not currently supported". But
    > the docs seem to indicate it should work?
    
    Which version of Slurm are you on?  It was disabled by default in
    17.11.x (and I'm not even sure it works if you enable it there) and
    seems to be enabled by default in 18.08.x.
    
    To see check the _enable_pack_steps() function src/srun/srun.c
    
    All the best,
    Chris (currently away in the UK)
    -- 
      Chris Samuel  :  https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.csamuel.org%2F&data=02%7C01%7Cchris.coffey%40nau.edu%7Cd8554994428d40e9902c08d62e7f8b5c%7C27d49e9f89e14aa099a3d35b57b2ba03%7C0%7C0%7C636747522680686112&sdata=DwgI40F74yX90rTHa4U4MtR2nPuSYqqlL5XV9XqSOXk%3D&reserved=0  :  Melbourne, VIC
    
    



More information about the slurm-users mailing list