[slurm-users] Header lengths are longer than data received after changing SelectType & GresTypes to use MPS

Robert Kudyba rkudyba at fordham.edu
Tue Apr 7 20:25:32 UTC 2020


Using Slurm 20.02 on CentIOS 7.7 with Bright Cluster. We changed the
following options to enable MPS:
SelectType=select/cons_tres
GresTypes=gpu,mic,mps

I restarted slurmctld and ran scontrol reconfigure, however all jobs get
the below error:
[2020-04-07T15:29:00.741] debug:  backfill: no jobs to backfill
[2020-04-07T15:29:03.051] Resending TERMINATE_JOB request JobId=3056
Nodelist=node[001-002]
[2020-04-07T15:29:03.051] Resending TERMINATE_JOB request JobId=3061
Nodelist=node003
[2020-04-07T15:29:03.051] debug:  sched: Running job scheduler
[2020-04-07T15:29:03.063] agent/is_node_resp: node:node003
RPC:REQUEST_TERMINATE_JOB : Header lengths are longer than data received
[2020-04-07T15:29:03.071] agent/is_node_resp: node:node002
RPC:REQUEST_TERMINATE_JOB : Header lengths are longer than data received
[2020-04-07T15:29:03.071] agent/is_node_resp: node:node001
RPC:REQUEST_TERMINATE_JOB : Header lengths are longer than data received

Do any other options need changing? What causes these header length errors?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20200407/08348bb0/attachment.htm>


More information about the slurm-users mailing list