For example if a job has to use different clusters with Slurm I am forced to launch it with two sbatch commands:
sbatch -M cluster1 job1
sbatch -m cluster2 job2
This way I get two different jobids. Using sacct I have not found a way to know that the two jobs were launched within the same workflow.
I was hoping not to have to add other components such as Nextflow.
From: David <drhey@umich.edu>
Sent: Thursday, August 29, 2024 2:53 PM
To: Di Bernardini, Fabio <dfabio@amazon.com>
Cc: slurm-users@lists.schedmd.com
Subject: RE: [EXTERNAL] [slurm-users] Best practices for tracking jobs started across multiple clusters for accounting purposes.
CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless
you can confirm the sender and know the content is safe. |
Hello,
What is meant here by "tracking"? What information are you looking to gather and track?
I'd say the simplest answer is using sacct, but I am not sure how federated/non-federated setups come into play while using it.
David
On Tue, Aug 27, 2024 at 6:23 AM Di Bernardini, Fabio via slurm-users <slurm-users@lists.schedmd.com> wrote:
I need to account for jobs composed of multiple jobs launched on multiple federated (and non-federated) clusters, which therefore have different job IDs. What are the best practices to prevent users from bypassing this tracking?
NICE SRL, viale Monte Grappa 3/5, 20124 Milano, Italia, Registro delle Imprese di Milano Monza Brianza Lodi REA n. 2096882, Capitale Sociale: 10.329,14 EUR i.v., Cod. Fisc. e P.IVA 01133050052, Societa con Socio Unico
--
slurm-users mailing list -- slurm-users@lists.schedmd.com
To unsubscribe send an email to slurm-users-leave@lists.schedmd.com
--
David Rhey
---------------
Advanced Research Computing
University of Michigan