[slurm-users] [EXT]Re: only 1 job running

Brian Andrus toomuchit at gmail.com
Thu Jan 28 20:59:25 UTC 2021


You are getting close :)
You can see why n010 is able to have multiple jobs. It shows more 
resources available.

What are the specific requests for resources from a job?
Nodes, Cores, Memory, threads, etc?

Brian Andrus

On 1/28/2021 12:52 PM, Chandler wrote:
> OK I'm getting this same output on nodes n[011-013]:
>
> # slurmd -C
> NodeName=n011 slurmd: error: FastSchedule will be removed in 20.02, as 
> will the FastSchedule=0 functionality. Please consider removing this 
> from your configuration now.
> slurmd: Considering each NUMA node as a socket
> slurmd: error: Thread count (255) not multiple of core count (128)
> CPUs=255 Boards=1 SocketsPerBoard=8 CoresPerSocket=16 ThreadsPerCore=1 
> RealMemory=1031878
> UpTime=86-20:59:54
> #
>
> but on n010 it looks like:
>
> # slurmd -C
> NodeName=n010 CPUs=256 Boards=1 SocketsPerBoard=2 CoresPerSocket=64 
> ThreadsPerCore=2 RealMemory=1031887
> UpTime=20-00:01:31
> #
>



More information about the slurm-users mailing list