[slurm-users] Extreme long db upgrade 16.05.6 -> 17.11.3
Ole Holm Nielsen
Ole.H.Nielsen at fysik.dtu.dk
Wed Jul 18 03:45:30 MDT 2018
On 07/18/2018 10:56 AM, Roshan Thomas Mathew wrote:
> We ran into this issue trying to move from 16.05.3 -> 17.11.7 with 1.5M
> records in job table.
>
> In our first attempt, MySQL reported "ERROR 1206 The total number of
> locks exceeds the lock table size" after about 7 hours.
>
> Increased InnoDB Buffer Pool size -
> https://dba.stackexchange.com/questions/27328/how-large-should-be-mysql-innodb-buffer-pool-size
> - to 12G (the machine hosting mysql has 128GB) and restarted the
> conversion and which then completed successfully in 6.5 hours.
>
> I am sure there are other MySQL tweaks that can be applied catered
> towards SLURM, will be useful if we can pool them together into the
> documentation.
I think this is a needle-in-haystack documentation problem :-)
The MySQL optimization has already been documented in
https://slurm.schedmd.com/accounting.html
I've summarized the information in my Wiki page:
https://wiki.fysik.dtu.dk/niflheim/Slurm_database#mysql-configuration
/Ole
More information about the slurm-users
mailing list