Kueue is a set of APIs and controller for job queueing. It is a job-level manager that decides when a job should be admitted to start (as in pods can be created) and when it should stop (as in active pods should be deleted).
Read the overview and watch the Kueue-related talks & presentations to learn more.
StrictFIFO and BestEffortFIFO.Based on community feedback, we continue to simplify and evolve the API to address new use cases.
Requires Kubernetes 1.29 or newer.
To install the latest release of Kueue in your cluster, run the following command:
kubectl apply --server-side -f https://github.com/kubernetes-sigs/kueue/releases/download/v0.18.2/manifests.yaml
The controller runs in the kueue-system namespace.
Read the installation guide to learn more.
A minimal configuration can be set by running the examples:
kubectl apply -f examples/admin/single-clusterqueue-setup.yaml
Then you can run a job with:
kubectl create -f examples/jobs/sample-job.yaml
Learn more about:
High-level overview of the main priorities for 2026: - Improve user experience for MultiKueue - multi-cluster Job dispatching, in particular: * Support Elastic RayJob #8712 * Workload-Level Admission Constraints and Preference-Aware MultiKueue Dispatching #8729 * Prevent starting preemptions in multiple worker clusters #8303 * Support long running services #8526 * Log retrieval from worker clusters #3526 - Improve user experience for Topology Aware Scheduling, in particular: * Support for ResourceTransformations #8860 * Support for Elastic Workloads #8160 * Evict workloads which are running on nodes which become tainted #8838 - Integration with the k8s native Workload-Aware Scheduler (WAS) and Topology-Aware Scheduling #8871 - Support for Concurrent Workload Admission #8691 - Support for running hero workloads #8826 - Consider preemption cost when finding preemption candidates #7990 - Progress towards Beta for the integration with Dynamic Resource Allocation (DRA) #8243
Long-term aspirational goals: - Partial preemption of serving workloads #3762 - Integration with workflow frameworks #74 - Budget support #28 - Flavor assignment strategies, e.g. minimizing cost vs minimizing borrowing #312 - Cooperative preemption support for workloads that implement checkpointing #477 - Delayed preemption for two-stage admission #3758 - Support Structured Parameters (DRA) in Kueue #2941 - Graduate the API to v1 #3476
Learn how to engage with the Kubernetes community on the community page and the contributor's guide.
You can reach the maintainers of this project at:
Participation in the Kubernetes community is governed by the Kubernetes Code of Conduct.
$ claude mcp add kueue \
-- python -m otcore.mcp_server <graph>