[slurm-users] seff Not Caluculating [FIXED?]

Diego Zuccato diego.zuccato at unibo.it
Thu Nov 19 07:22:56 UTC 2020


Il 18/11/20 15:15, Jason Simms ha scritto:

> Use of uninitialized value $hash{"2"} in division (/) at /bin/seff line
> 108, <DATA> line 602.
> Use of uninitialized value $hash{"2"} in division (/) at /bin/seff line
> 108, <DATA> line 602.
Seems some setups report data in a different format, hence the
uninitialized value. In my case, there's no $step->{'stats'}{'rss_max'}
and I've had to use $step->{'stats'}{'tres_usage_in_max'} . Maybe your
install uses the former?
I'm using Debian stable (Buster) w/ default packages (currently
18.08.5.2-1+deb10u1).

What I get:
-8<--
$ /home/software/utils/seff 9604
Job ID: 9604
Cluster: oph
User/Group: name.surname/domain^users
State: COMPLETED (exit code 0)
Cores: 1
CPU Utilized: 00:01:40
CPU Efficiency: 98.04% of 00:01:42 core-walltime
Memory Utilized: 203.71 MB
Memory Efficiency: 1.46% of 13.67 GB

$ sacct -a --format
JobID,User,Group,State,Cluster,AllocCPUS,REQMEM,TotalCPU,Elapsed,MaxRSS,ExitCode,NNodes,NTasks
-j 9604
       JobID      User     Group      State    Cluster  AllocCPUS
ReqMem   TotalCPU    Elapsed     MaxRSS ExitCode   NNodes   NTasks
------------ --------- --------- ---------- ---------- ----------
---------- ---------- ---------- ---------- -------- -------- --------
9604         name.sur+ domain^u+  COMPLETED        oph          1
13993Mn  01:39.703   00:01:42                 0:0        1
9604.batch                        COMPLETED        oph          1
13993Mn  01:39.703   00:01:42    208604K      0:0        1        1
-8<--

You can try enabling Dumper uncommenting lines 11 and 60. My result:
-8<--
$ bin/seff.debug 9604
$VAR1 = {
          'eligible' => 1605621478,
          'timelimit' => 300,
          'derived_ec' => 0,
          'resvid' => 0,
          'user' => 'name.surname',
          'nodes' => 'str957-bl0-17',
          'uid' => 0,
          'account' => 'astro',
          'sys_cpu_usec' => 0,
          'jobname' => 'job-blade-serial.sh',
          'show_full' => 1,
          'start' => 1605621479,
          'user_cpu_sec' => 0,
          'priority' => 1,
          'req_cpus' => 1,
          'tot_cpu_sec' => 0,
          'end' => 1605621581,
          'qosid' => 1,
          'suspended' => 0,
          'state' => 3,
          'array_max_tasks' => 0,
          'exitcode' => 0,
          'wckeyid' => 0,
          'tres_alloc_str' =>
'1=1,2=13993,3=18446744073709551614,4=1,5=150',
          'wckey' => '',
          'elapsed' => 102,
          'sys_cpu_sec' => 0,
          'lft' => 250,
          'requid' => 4294967295,
          'req_mem' => 13993,
          'submit' => 1605621478,
          'track_steps' => 1,
          'partition' => 'b5',
          'cluster' => 'oph',
          'array_job_id' => 0,
          'user_cpu_usec' => 0,
          'jobid' => 9604,
          'stats' => {
                       'consumed_energy' => 0,
                       'act_cpufreq' => '0'
                     },
          'alloc_gres' => '',
          'associd' => 20,
          'array_task_id' => 4294967294,
          'req_gres' => '',
          'alloc_nodes' => 1,
          'gid' => 2125988353,
          'tot_cpu_usec' => 0,
          'steps' => [
                       {
                         'user_cpu_sec' => 99,
                         'tot_cpu_sec' => 99,
                         'stats' => {
                                      'tres_usage_in_ave' =>
'1=88010,2=213610496,3=0,6=95527596,7=514859008,8=0',
                                      'tres_usage_in_min_nodeid' =>
'1=0,2=0,3=0,6=0,7=0,8=0',
                                      'tres_usage_out_tot' =>
'3=0,6=231676',
                                      'tres_usage_in_min_taskid' =>
'1=0,2=0,6=0,7=0,8=0',
                                      'tres_usage_out_max' =>
'3=0,6=231676',
                                      'tres_usage_in_tot' =>
'1=88010,2=213610496,3=0,6=95527596,7=514859008,8=0',
                                      'tres_usage_in_min' =>
'1=88010,2=213610496,3=0,6=95527596,7=514859008,8=0',
                                      'consumed_energy' => 0,
                                      'tres_usage_out_max_nodeid' =>
'3=0,6=0',
                                      'tres_usage_out_max_taskid' => '6=0',
                                      'act_cpufreq' => '8755',
                                      'tres_usage_in_max_taskid' =>
'1=0,2=0,6=0,7=0,8=0',
                                      'tres_usage_out_ave' =>
'3=0,6=231676',
                                      'tres_usage_out_min' =>
'3=0,6=231676',
                                      'tres_usage_in_max' =>
'1=88010,2=213610496,3=0,6=95527596,7=514859008,8=0',
                                      'tres_usage_out_min_taskid' => '6=0',
                                      'tres_usage_in_max_nodeid' =>
'1=0,2=0,3=0,6=0,7=0,8=0',
                                      'tres_usage_out_min_nodeid' =>
'3=0,6=0'
                                    },
                         'nnodes' => 1,
                         'end' => 1605621581,
                         'stepid' => 4294967294,
                         'suspended' => 0,
                         'state' => 3,
                         'tres_alloc_str' => '1=1,2=13993,4=1',
                         'exitcode' => 0,
                         'tot_cpu_usec' => 703732,
                         'stepname' => 'batch',
                         'requid' => 4294967295,
                         'req_cpufreq_gov' => 0,
                         'elapsed' => 102,
                         'sys_cpu_sec' => 0,
                         'task_dist' => 0,
                         'req_cpufreq_min' => 0,
                         'sys_cpu_usec' => 587602,
                         'ntasks' => 1,
                         'nodes' => 'str957-bl0-17',
                         'start' => 1605621479,
                         'user_cpu_usec' => 116130,
                         'req_cpufreq_max' => 0
                       }
                     ]
        };
Job ID: 9604
Cluster: oph
User/Group: name.surname/domain^users
State: COMPLETED (exit code 0)
Cores: 1
CPU Utilized: 00:01:40
CPU Efficiency: 98.04% of 00:01:42 core-walltime
Memory Utilized: 203.71 MB
Memory Efficiency: 1.46% of 13.67 GB
-8<--

-- 
Diego Zuccato
DIFA - Dip. di Fisica e Astronomia
Servizi Informatici
Alma Mater Studiorum - Università di Bologna
V.le Berti-Pichat 6/2 - 40127 Bologna - Italy
tel.: +39 051 20 95786



More information about the slurm-users mailing list