Dear all,
I am working on a script to take completed job accounting data from the slurm accounting database and insert the equivalent data into a clickhouse table for fast reporting
I can see that all the information is included in the cluster_job_table and cluster_job_step_table which seem to be joined on job_db_inx
To get the cpu usage and peak memory usage etc. I can see that I need to parse the tres columns in the job steps. I couldn't find any column called MaxRSS in the database even though the sacct command prints this. I then found some data in tres_table and assume that sacct is using this. Please correct me if I'm wrong and if sacct is getting information from somwhere other than the accounting database?
for the state column I get this...
select state, count(*) as num from crg_step_table group by state order by num desc limit 10;
+-------+--------+
| state | num |
+-------+--------+
| 3 | 590635 |
| 5 | 28345 |
| 4 | 4401 |
| 11 | 962 |
| 1 | 8 |
+-------+--------+
When I use sacct I see statuses seach as COMPLETED, OUT_OF_MEMORY etc. so there must be a mapping somewhere between these state ids and that text. Can someone prvide that mapping or point me to where it's defined in the database or in the code ?
Many thanks,
Emyr James
Head of Scientific IT
CRG - Centre for Genomic Regulation