[slurm-users] sacct returns nothing after reboot
Marcus Boden
mboden at gwdg.de
Wed May 13 13:58:18 UTC 2020
Hi,
the default time window starts at 00:00:00 of the current day:
-S, --starttime
Select jobs in any state after the specified time. Default
is 00:00:00 of the current day, unless the '-s' or '-j'
options are used. If the '-s' option is used, then the
default is 'now'. If states are given with the '-s' option
then only jobs in this state at this time will be returned.
If the '-j' option is used, then the default time is Unix
Epoch 0. See the DEFAULT TIME WINDOW for more details.
Best,
Marcus
On 5/12/20 2:08 PM, Roger Mason wrote:
> Hello,
>
> Yesterday I instituted job accounting via mysql on my (FreeBSD 11.3)
> test cluster. The cluster consists of a machine running
> slurmctld+slurmdbd and two running slurmd (slurm version 20.02.1).
> After experiencing a slurmdbd core dump when using mysql-5.7.30
> (reported on this list on May 5) I installed 5.7.28 instead.
>
> Before yesterday I had no accounting of any kind. I had observed the
> behaviour that the job id's always restarted at 2 after a reboot. After
> installing mysql and setting it up I ran a few test jobs and verified
> that sacct listed them: all seemed well.
>
> This morning upon re-booting the machine running slurmctld+slurmdbd
> sacct returns nothing:
>
> rmason sacct --allusers
> JobID JobName Partition Account AllocCPUS State ExitCode
> ------------ ---------- ---------- ---------- ---------- ---------- --------
>
> so it seems that yesterday's jobs have been forgotten.
>
> When I connect to mysql as the user owning the databases it seems there
> is information present. For example,
>
> select * from imacbeastie_job_table;
>
> returns information about the test jobs I ran yesterday.
>
> As a further test I just ran another test job:
>
> squeue
> JOBID PARTITION NAME USER ST TIME NODES NODELIST(REASON)
> 2 imac test rmason R 0:03 1 patchperthite
>
> I notice that the jobid starts at 2 (I ran 5 or 6 test jobs yesterday).
>
> sacct now returns information:
> sacct --allusers
> JobID JobName Partition Account AllocCPUS State ExitCode
> ------------ ---------- ---------- ---------- ---------- ---------- --------
> 2 test imac 2 COMPLETED 0:0
> 2.batch batch 2 COMPLETED 0:0
> 2.0 hostname 1 COMPLETED 0:0
> 2.1 sleep 1 COMPLETED 0:0
>
> but only for the test job I ran today.
>
> I appreciate any help in getting accounting to work properly.
>
> Thanks,
> Roger
>
--
Marcus Vincent Boden, M.Sc.
Arbeitsgruppe eScience
Tel.: +49 (0)551 201-2191
E-Mail: mboden at gwdg.de
---------------------------------------
Gesellschaft fuer wissenschaftliche
Datenverarbeitung mbH Goettingen (GWDG)
Am Fassberg 11, 37077 Goettingen
URL: http://www.gwdg.de
E-Mail: gwdg at gwdg.de
Tel.: +49 (0)551 201-1510
Fax: +49 (0)551 201-2150
Geschaeftsfuehrer: Prof. Dr. Ramin Yahyapour
Aufsichtsratsvorsitzender:
Prof. Dr. Christian Griesinger
Sitz der Gesellschaft: Goettingen
Registergericht: Goettingen
Handelsregister-Nr. B 598
---------------------------------------
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 5376 bytes
Desc: S/MIME Cryptographic Signature
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20200513/809354ff/attachment-0001.bin>
More information about the slurm-users
mailing list