[slurm-users] Unable to submit job (ReqNodeNotAvail, UnavailableNodes)
JP Ebejer
jean.p.ebejer at um.edu.mt
Tue Nov 7 10:15:42 UTC 2023
Hi there Diego,
Grazie per il vostro aiuto.
I had to use sudo to switch to the slurm user, as with myuser I got
"slurm_update error: Invalid user id".
$ sudo -u slurm scontrol update nodename=compute-0 state=resume
This works (I think, as it returns no visual cue), but on running sinfo
right after, the node is still "drained".
$ sinfo --Node --long
Tue Nov 07 10:08:27 2023
NODELIST NODES PARTITION STATE CPUS S:C:T MEMORY TMP_DISK
WEIGHT AVAIL_FE REASON
compute-0 1 all_nodes* drained 32 2:8:2 60000 0
1 (null) batch job complete f
In my jobs (squeue) I now also have the failed jobs
$ squeue --long -u $USER
JOBID PARTITION NAME USER ST TIME NODES
NODELIST(REASON)
9 all_nodes hello_wo myuser PD 0:00 1
(ReqNodeNotAvail, UnavailableNodes:compute-0)
11 all_nodes hello_wo myuser PD 0:00 1
(ReqNodeNotAvail, UnavailableNodes:compute-0)
What am I missing here please?
On Tue, 7 Nov 2023 at 10:36, Diego Zuccato <diego.zuccato at unibo.it> wrote:
> Il 07/11/2023 10:12, JP Ebejer ha scritto:
>
> > sinfo shows that the node is drained (but this node is idle and has no
> > processing)
> >
> > $ sinfo --Node --long
> > Tue Nov 07 08:29:51 2023
> > NODELIST NODES PARTITION STATE CPUS S:C:T MEMORY TMP_DISK
> > WEIGHT AVAIL_FE REASON
> > compute-0 1 all_nodes* drained 32 2:8:2 60000 0
> > 1 (null) batch job complete f
> You have to RESUME the node so it starts accepting jobs.
> scontrol update nodename=compute-0 state=resume
>
> --
> Diego Zuccato
> DIFA - Dip. di Fisica e Astronomia
> Servizi Informatici
> Alma Mater Studiorum - Università di Bologna
> V.le Berti-Pichat 6/2 - 40127 Bologna - Italy
> tel.: +39 051 20 95786
>
>
--
<https://www.um.edu.mt/>
Prof. Jean-Paul Ebejer | Associate Professor
BSc (Hons) (Melita), MSc (Imperial), DPhil (Oxon.)
*Centre for Molecular Medicine and Biobanking*
Office 320, Biomedical Sciences Building,
University of Malta, Msida, MSD 2080. MALTA.
T: (00356) 2340 3263
*Department of Artificial Intelligence*
Associate Member
Join the *Bioinformatics at UM*
<https://groups.google.com/a/um.edu.mt/g/mailinglist-bioinformatics.research>
mailing
list!
*Where to find me* <https://bitsilla.com/blog/where-to-find-me/>
[image: https://twitter.com/dr_jpe] <https://twitter.com/dr_jpe> [image:
https://bitsilla.com/blog/] <https://bitsilla.com/blog/> [image:
https://github.com/jp-um] <https://github.com/jp-um>
--
*The contents of this email are subject to *these terms
<https://www.um.edu.mt/disclaimer/email/>.**
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20231107/b26a12b5/attachment.htm>
More information about the slurm-users
mailing list