Integrate with JobSet
This page shows how to use Alauda Build of Kueue to manage JobSet workloads. A JobSet allows you to define a group of related Kubernetes Jobs that are managed together as a single unit.
JobSet is useful for distributed workloads that consist of multiple job components, such as a driver job and multiple worker jobs, where all components must run together.
Prerequisites
- You have installed the Alauda Build of Kueue.
- You have installed the JobSet controller.
- The
JobSetframework is enabled in the Kueue configuration. - The Alauda Container Platform Web CLI has communication with your cluster.
- You have created a
ClusterQueue,ResourceFlavor, andLocalQueue.
Procedure
-
Create a
JobSetresource with thekueue.x-k8s.io/queue-namelabel:kueue.x-k8s.io/queue-name: Specifies the LocalQueue that manages this JobSet. Kueue admits all replicated jobs together as a single unit.workers: A replicated job that runs 4 parallel worker pods, each requesting 1 GPU.driver: A single driver pod that coordinates the workers.
-
Apply the JobSet:
-
Monitor the JobSet admission:
-
Check the JobSet status: