[slurm-users] sacct returns nothing after reboot

Marcus Boden mboden at gwdg.de
Wed May 13 13:58:18 UTC 2020


Hi,

the default time window starts at 00:00:00 of the current day:
-S, --starttime
          Select jobs in any state after the specified  time.  Default
          is  00:00:00  of  the  current  day, unless the '-s' or '-j'
          options are used. If the  '-s'  option  is  used,  then  the
          default  is  'now'. If states are given with the '-s' option
          then only jobs in this state at this time will be  returned.
          If  the  '-j'  option is used, then the default time is Unix
          Epoch 0. See the DEFAULT TIME WINDOW for more details.

Best,
Marcus


On 5/12/20 2:08 PM, Roger Mason wrote:
> Hello,
> 
> Yesterday I instituted job accounting via mysql on my (FreeBSD 11.3)
> test cluster.  The cluster consists of a machine running
> slurmctld+slurmdbd and two running slurmd (slurm version 20.02.1).
> After experiencing a slurmdbd core dump when using mysql-5.7.30
> (reported on this list on May 5) I installed 5.7.28 instead.
> 
> Before yesterday I had no accounting of any kind.  I had observed the
> behaviour that the job id's always restarted at 2 after a reboot.  After
> installing mysql and setting it up I ran a few test jobs and verified
> that sacct listed them: all seemed well.
> 
> This morning upon re-booting the machine running slurmctld+slurmdbd
> sacct returns nothing:
> 
> rmason sacct --allusers
>        JobID    JobName  Partition    Account  AllocCPUS      State ExitCode
> ------------ ---------- ---------- ---------- ---------- ---------- --------
> 
> so it seems that yesterday's jobs have been forgotten.
> 
> When I connect to mysql as the user owning the databases it seems there
> is information present.  For example,
> 
> select * from imacbeastie_job_table;
> 
> returns information about the test jobs I ran yesterday.
> 
> As a further test I just ran another test job:
> 
> squeue
> JOBID PARTITION     NAME     USER ST       TIME  NODES NODELIST(REASON)
>  2      imac        test   rmason  R       0:03      1 patchperthite
> 
> I notice that the jobid starts at 2 (I ran 5 or 6 test jobs yesterday).
> 
> sacct now returns information:
> sacct --allusers
>        JobID    JobName  Partition    Account  AllocCPUS      State ExitCode
> ------------ ---------- ---------- ---------- ---------- ---------- --------
> 2                  test       imac                     2  COMPLETED      0:0
> 2.batch           batch                                2  COMPLETED      0:0
> 2.0            hostname                                1  COMPLETED      0:0
> 2.1               sleep                                1  COMPLETED      0:0
> 
> but only for the test job I ran today.
> 
> I appreciate any help in getting accounting to work properly.
> 
> Thanks,
> Roger
> 

-- 
Marcus Vincent Boden, M.Sc.
Arbeitsgruppe eScience
Tel.:   +49 (0)551 201-2191
E-Mail: mboden at gwdg.de
---------------------------------------
Gesellschaft fuer wissenschaftliche
Datenverarbeitung mbH Goettingen (GWDG)
Am Fassberg 11, 37077 Goettingen
URL:    http://www.gwdg.de
E-Mail: gwdg at gwdg.de
Tel.:   +49 (0)551 201-1510
Fax:    +49 (0)551 201-2150
Geschaeftsfuehrer: Prof. Dr. Ramin Yahyapour
Aufsichtsratsvorsitzender:
Prof. Dr. Christian Griesinger
Sitz der Gesellschaft: Goettingen
Registergericht: Goettingen
Handelsregister-Nr. B 598
---------------------------------------

-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 5376 bytes
Desc: S/MIME Cryptographic Signature
URL: <http://lists.schedmd.com/pipermail/slurm-users/attachments/20200513/809354ff/attachment-0001.bin>


More information about the slurm-users mailing list