As you say, it does reduce overall throughput, because those large-gpu nodes are dedicated. We don’t have many nodes in that partition, the rest are in our standard queues. Ultimately,
this is a tradeoff. We have chosen to reduce total throughput slightly in order to make sure that large jobs actually get scheduled in a timely fashion.
This is the boulders-and-sand scheduling problem; I don’t think there’s any single perfect configuration, it just depends on what’s most important to your business; throughput, or
the turnaround time of particular jobs.
Tim
--
Tim Cutts
Senior Director, R&D IT - Data, Analytics & AI, Scientific Computing Platform
AstraZeneca
Find out more about R&D IT Data, Analytics & AI and how we can support you by visiting our Service
Catalogue |
AstraZeneca UK Limited is a company incorporated in England and Wales with registered number:03674842 and its registered office at 1 Francis Crick Avenue, Cambridge Biomedical Campus, Cambridge, CB2 0AA.
This e-mail and its attachments are intended for the above named recipient only and may contain confidential and privileged information. If they have come to you in error, you must not copy or show them to anyone; instead, please reply to this e-mail, highlighting the error to the sender and then immediately delete the message. For information about how AstraZeneca UK Limited and its affiliates may process information, personal data and monitor communications, please see our privacy notice at www.astrazeneca.com