[slurm-users] slurm 17.11.2: Socket timed out on send/recv operation

Alessandro Federico a.federico at cineca.it
Fri Jan 12 03:32:57 MST 2018


Hi all, 


we are setting up SLURM 17.11.2 on a small test cluster of about 100 nodes. 
Sometimes we get the error in the subject when running any SLURM command (e.g. sinfo, squeue, scontrol reconf, etc...) 


Do we have to apply any particular setting to avoid incurring the problem? 


We found this bug report https://bugs.schedmd.com/show_bug.cgi?id=4002 but it regards the previous SLURM version 
and we do not set debug3 on slurmctld. 


thanks in advance 
ale 

-- 

Alessandro Federico 
HPC System Management Group 
System & Technology Department 
CINECA www.cineca.it 
Via dei Tizii 6, 00185 Rome - Italy 
phone: +39 06 44486708 

All work and no play makes Jack a dull boy. 
All work and no play makes Jack a dull boy. 
All work and no play makes Jack... 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20180112/553b2153/attachment.html>


More information about the slurm-users mailing list