[slurm-users] "sacctmgr add cluster" crashing slurmdbd

Marcus Wagner wagner at itc.rwth-aachen.de
Wed May 6 13:44:02 UTC 2020


Hi, same here :/

the segfault happens after the procedure call in mysql:

call get_parent_limits('assoc_table', 'rwth0515', 'rcc', 0); select 
@par_id, @mj, @mja, @mpt, @msj, @mwpj, @mtpj, @mtpn, @mtmpj, @mtrm, 
@def_qos_id, @qos, @delta_qos;

The mysql answer is:

+---------+------+------+------+------+-------+-------+-------+--------+-------+-------------+------+------------+
| @par_id | @mj  | @mja | @mpt | @msj | @mwpj | @mtpj | @mtpn | @mtmpj | 
@mtrm | @def_qos_id | @qos | @delta_qos |
+---------+------+------+------+------+-------+-------+-------+--------+-------+-------------+------+------------+
|    5312 |  800 | NULL | NULL | 1000 |  1440 | 1=8   | NULL  | NULL   | 
NULL  |        NULL | ,1,  | NULL       |
+---------+------+------+------+------+-------+-------+-------+--------+-------+-------------+------+------------+

the segfault happens in as_mysql_assoc.c:

#0  0x00002ae3dea6c05a in _cluster_get_assocs 
(mysql_conn=mysql_conn at entry=0x2ae3f4000d70, 
user=user at entry=0x2ae3e1feca90, 
assoc_cond=assoc_cond at entry=0x2ae3f40009f0, cluster_name=0x63f110 "rcc",
     fields=<optimized out>, sent_extra=<optimized out>, 
is_admin=is_admin at entry=true, sent_list=sent_list at entry=0x6dc030) at 
as_mysql_assoc.c:2141
2141                                    if (row2[ASSOC2_REQ_MTPN][0])

hope that helps anyone.

Best
Marcus

Am 06.05.2020 um 09:49 schrieb Ben Polman:
> On 06-05-2020 07:38, Chris Samuel wrote:
> 
> We are experiencing exactly the same problem after mysql upgrade to 5.7.30,
> moving database to old mysql server running 5.6 solves the problem.
> Most likely downgrading mysql to 5.7.29 will work as well
> 
> I have no clue which change in mysql-server is causing this
> 
> best regards,
> Ben
> 
>> On Tuesday, 5 May 2020 3:21:45 PM PDT Dustin Lang wrote:
>>
>>> Since this happens on a fresh new database, I just don't understand how I
>>> can get back to a basic functional state.  This is exceedingly frustrating.
>> I have to say that if you're seeing this with 17.11, 18.08 and 19.05 and this
>> only started when your colleague upgraded MySQL then this sounds like MySQL is
>> triggering this problem.
>>
>> We're running with MariaDB 10.x (from SLES15) without issues (our database is
>> huge).
>>
>> All the best,
>> Chris
> 
> 

-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 5326 bytes
Desc: S/MIME Cryptographic Signature
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20200506/faea71f1/attachment.bin>


More information about the slurm-users mailing list