[slurm-users] Extreme long db upgrade 16.05.6 -> 17.11.3

Hendryk Bockelmann bockelmann at dkrz.de
Thu Feb 22 03:12:15 MST 2018


Hi Chris,

we were faced with exactly the same problem - update of 16.05.11 to 
17.11.3 took more than 24 hours without finalizing the conversion of job 
table. Finally, we cancelled the process, went back to "old" version 
16.05.11 and restored the database. At that time we had 10.5 million 
jobs in db with a total db size of 11 GB.

Our "solution":
* Activate purge/archive in slurmdbd.conf keeping just 1months of data. 
This reduced the db to 250000 jobs and 600 MB (needed roughly 2 hours 
for initial purge)
* Update to slurm 17.11.3 with reduced db; it took just 15 minutes!

Best,
Hendryk

-- 
Dr. Hendryk Bockelmann
Wissenschaftliches Rechnen
Abteilung Anwendungen

Deutsches Klimarechenzentrum GmbH (DKRZ)
Bundesstraße 45 a, D-20146 Hamburg, Germany

-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 4973 bytes
Desc: S/MIME Cryptographic Signature
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20180222/18adab68/attachment.bin>


More information about the slurm-users mailing list