[slurm-users] Slurm version 20.11.6 is now available

Tim Wickberg tim at schedmd.com
Tue Apr 27 20:36:38 UTC 2021

We are pleased to announce the availability of Slurm version 20.11.6.

This includes a number of minor-to-moderate severity fixes, as well as 
improvements to the recently added job_container/tmpfs plugin.

Slurm can be downloaded from https://www.schedmd.com/downloads.php .

- Tim

Tim Wickberg
Chief Technology Officer, SchedMD LLC
Commercial Slurm Development and Support

> * Changes in Slurm 20.11.6
> ==========================
>  -- Fix sacct assert with the --qos option.
>  -- Use pkg-config --atleast-version instead of --modversion for systemd.
>  -- common/fd - fix getsockopt() call in fd_get_socket_error().
>  -- Properly handle the return from fd_get_socket_error() in _conn_readable().
>  -- cons_res - Fix issue where running jobs were not taken into consideration
>     when creating a reservation.
>  -- Avoid a deadlock between job_list for_each and assoc QOS_LOCK.
>  -- Fix TRESRunMins usage for partition qos on restart/reconfig.
>  -- Fix printing of number of tasks on a completed job that didn't request
>     tasks.
>  -- Fix updating GrpTRESRunMins when decrementing job time is bigger than it.
>  -- Make it so we handle multithreaded allocations correctly when doing
>     --exclusive or --core-spec allocations.
>  -- Fix incorrect round-up division in _pick_step_cores
>  -- Use appropriate math to adjust cpu counts when --ntasks-per-core=1.
>  -- cons_tres - Fix consideration of power downed nodes.
>  -- cons_tres - Fix DefCpuPerGPU, increase cpus-per-task to match with
>     gpus-per-task * cpus-per-gpu.
>  -- Fix under-cpu memory auto-adjustment when MaxMemPerCPU is set.
>  -- Make it possible to override CR_CORE_DEFAULT_DIST_BLOCK.
>  -- Perl API - fix retrieving/storing of slurm_step_id_t in job_step_info_t.
>  -- Recover state of burst buffers when slurmctld is restarted to avoid skipping
>     burst buffer stages.
>  -- Fix race condition in burst buffer plugin which caused a burst buffer
>     in stage-in to not get state saved if slurmctld stopped.
>  -- auth/jwt - print an error if jwt_file= has not been set in slurmdbd.
>  -- Fix RESV_DEL_HOLD not being a valid state when using squeue --states.
>  -- Add missing squeue selectable states in valid states error message.
>  -- Fix scheduling last array task multiple times on error, causing segfault.
>  -- Fix issue where a step could be allocated more memory than the job when
>     dealing with --mem-per-cpu and --threads-per-core.
>  -- Fix removing qos from assoc with -= can lead to assoc with no qos
>  -- auth/jwt - fix segfault on invalid credential in slurmdbd due to
>     missing validate_slurm_user() function in context.
>  -- Fix single Port= not being applied to range of nodes in slurm.conf
>  -- Fix Jobs not requesting a tres are not starting because of that tres limit.
>  -- acct_gather_energy/rapl - fix AveWatts calculation.
>  -- job_container/tmpfs - Fix issues with cleanup and slurmd restarting on
>     running jobs.

More information about the slurm-users mailing list