Introduce AEP with Provisioning Request CRD #5848

kisieland · 2023-06-13T09:16:23Z

What type of PR is this?

/kind feature

What this PR does / why we need it:

This PR adds a new AEP to support a new CRD in cluster autoscaler.
This novel API will allow users to specify a dependency between a group of pods.

Does this PR introduce a user-facing change?

NONE

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

N/A

qianlei90 · 2023-06-13T13:23:08Z

FYI：

Cluster autoscaler improvements for AI workloads #5170 (comment)
kueue want to use ResourceClaim API to interactive with CA：https://docs.google.com/document/d/1fYpdB_a3YYAOHfFkx6GnEBRuAZIVfN2TeILrwQWF8w4/edit#heading=h.33c0ab4ic73i

mwielgus · 2023-06-14T20:26:01Z

@qianlei90 Yes, it should help with the issue.

cluster-autoscaler/proposals/provisioning-request.md

qelo · 2023-06-16T17:24:54Z

cluster-autoscaler/proposals/provisioning-request.md

+    created by the CA.
+5.  Once all of the pods are scheduled users can delete the ProvReq object,
+    otherwise it will be garbage collected after some time.
+6.  When pods finish the work and nodes become unused the CA will scale them


What are the rules for scale down when nodes are provisioned but pods didn't start (or didn't start yet)?

It would be great if this was somehow configurable in the longer run.

The current proposal would be to use the same logic for scale-down so it would be configurable via --scale-down-unneeded-time flag.
For now there is no way for CA to know which VMs come from the atomic scale-up so we cannot differ.

qelo · 2023-06-16T17:26:54Z

cluster-autoscaler/proposals/provisioning-request.md

+
+	// ProvisioningClass describes the different modes of provisioning the resources.
+	// Supported values:
+	// * GENERIC_CHECK_CAPACITY - check if current cluster state can fullfil this request


Is there a strong need for GENERIC_CHECK_CAPACITY ? There is no scaling in that case , so it seems somewhat outside of "autoscaler" responsbilities. Could we maybe start with just GENERIC_ATOMIC?

Also, what happens if there are two CHECK_CAPACITY requests but there is capacity for one? Does one get capacity and the other one fails? When does the hold expire and is it consistent with how ProvisioningRequest scale downs for GENERIC_ATOMIC_SCALE_UP?

I think that there may be quite a few use cases. For example, trying to run low priority jobs only when there is capacity on the statically allocated nodes. Currently K8S doesn't have gang scheduling, and this may work as a best-effort replacement.
The problem with invalidation exist in all cases where shared/non-exclusive resources are provisioned. This should be addressed in the doc. Possible options are:

Invalidate request that is still there.

Treat PR as a point-in-time query and don't bother about invalidation.

Given that we already have 2 possible behaviors for a single capacity, the params/classes may be actually needed from day 1.

cluster-autoscaler/proposals/provisioning-request.md

severinson · 2023-06-22T14:20:20Z

This capability is interesting for the Armada project, where we'd like to control resource provisioning separately from pod scheduling. Some context: with Armada we typically have long queues of pods that can't currently be scheduled, but we may not want to provision enough capacity to run those pods immediately.

cluster-autoscaler/proposals/provisioning-request.md

asm582 · 2023-06-23T01:26:17Z

Can other external controllers instead of CA use provisioning CRD to allocate resources?

kisieland · 2023-06-26T12:00:35Z

@asm582

Can other external controllers instead of CA use provisioning CRD to allocate resources?

CA is meant as the only component reacting to the CRD, that said if it is not installed there is nothing blocking another component stepping to fill the void (eg. the Karpenter component).
Note: One potential way we can have different components reacting to CRDs in the same cluster is to split them via ProvisioningClass.

As for interacting with the CRD any other component can create CRDs to ask CA to provision the VMs.

asm582 · 2023-06-26T12:06:28Z

Note: One potential way we can have different components reacting to CRDs in the same cluster is to split them via ProvisioningClass.

Thanks @kisieland, can you please explain the below statement a bit more:

Note: One potential way we can have different components reacting to CRDs in the same cluster is to split them via ProvisioningClass.

kisieland · 2023-06-26T12:57:46Z

@asm582

We can have ProvisioningClasses GenericAtomicScaleUp and CloudProviderSomOtherProvisioningMode.
CRs with first of which would be handled by the CA and the CRs with the second one would be handled by some other component.

kisieland · 2023-06-26T12:57:57Z

/label api-review

alculquicondor

I forgot to send my review last week :(

cluster-autoscaler/proposals/provisioning-request.md

kisieland · 2023-06-30T12:13:09Z

Thanks @alculquicondor addressed your comments

cluster-autoscaler/proposals/provisioning-request.md

jonathan-innis

From reading through this, this strikes me as a static capacity request for "ghost pods". i.e. I want negative preemption pods to represent my capacity without those pods actually existing and being interacted with by the kube-scheduler. Is there a similar hierarchical relationship that we can build here similar to how we have varying levels of abstraction for Pod/ReplicaSet/Deployment. Is there a way to represent a capacity request as a single pod and then build off of it with a replica count? Then CAS recognizes either real pods or these static capacity ghost pods. There's also a question in my head about the difference in use-cases here. One use-case is I always want overhead capacity for a warm pool at all times; the other is I eventually want that static capacity to be filled and I don't want to further scale up my capacity when that pre-provisioned static capacity is utilized.

cluster-autoscaler/proposals/provisioning-request.md

kisieland · 2023-08-28T15:42:38Z

@liggitt PTAL

mwielgus · 2023-09-07T10:36:46Z

@liggitt @jpbetz Do you have any more comments? If no (or there is no response) i will be merging the PR soon.

kisieland · 2023-09-08T11:00:51Z

Re @jpbetz:

... Is there a way to represent a capacity request as a single pod and then build off of it with a replica count?

Provisioning Requests can represent one pod. But building more of those objects and expecting CAS to group them beets the purpose of the ProvReq object in the first place, why use such ProvReqs, when one can just use pods?
The issue that ProvReq aims to represent groups of Pods is to avoid CA making decisions of partial state of the cluster.
If we used labels to group pods CAS will not know when all of the pods were created and might have started work that is not needed.

One use-case is I always want overhead capacity for a warm pool at all times; the other is I eventually want that static capacity to be filled and I don't want to further scale up my capacity when that pre-provisioned static capacity is utilized.

Here we propose to implement the second mode, but there is nothing prevent you or anybody else from adding a constan size buffer mode, all the required fields are here, only CAS logic would need to be slightly modified.

mwielgus

/lgtm
/approve

k8s-ci-robot · 2023-09-11T08:22:05Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: kisieland, mwielgus

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~cluster-autoscaler/OWNERS~~ [mwielgus]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

kerthcet

Hi, I'm not quite familiar with autoscaler source code, but do care about this feature, so I left some comments, maybe not mature, forgive me. 🥲

cluster-autoscaler/proposals/provisioning-request.md

liggitt · 2023-09-14T14:01:51Z

@liggitt @jpbetz Do you have any more comments? If no (or there is no response) i will be merging the PR soon.

Sorry for the delay, we have a bi-weekly API review that will be reviewing the updates today.

I see a CRD was merged already in #6104 linking to this PR as API approval.
Lazy consensus does not constitute API approval for k8s.io APIs. Please move the CRD to the x-k8s.io domain if you want to proceed without API approval, or revert the PR until approval is given.

mwielgus · 2023-09-14T14:11:15Z

The API was meant to go to x-k8s.io from the very beginning. Thanks for noticing it. It will be fixed soon.

kisieland · 2023-09-14T14:22:47Z

Thanks @liggitt for noticing created #6108 to address this!

k8s-ci-robot added kind/feature Categorizes issue or PR as related to a new feature. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/L Denotes a PR that changes 100-499 lines, ignoring generated files. area/cluster-autoscaler labels Jun 13, 2023

k8s-ci-robot requested review from BigDarkClown and x13n June 13, 2023 09:16

mwielgus self-assigned this Jun 14, 2023

mwielgus reviewed Jun 14, 2023

View reviewed changes

cluster-autoscaler/proposals/provisioning-request.md Outdated Show resolved Hide resolved

mwielgus reviewed Jun 14, 2023

View reviewed changes

cluster-autoscaler/proposals/provisioning-request.md Outdated Show resolved Hide resolved

kisieland force-pushed the aep-prov-req branch from 3b0c34d to b3be434 Compare June 16, 2023 09:11

qelo reviewed Jun 16, 2023

View reviewed changes

kisieland force-pushed the aep-prov-req branch 2 times, most recently from 311bd7e to bfddb12 Compare June 19, 2023 15:54

qianlei90 reviewed Jun 20, 2023

View reviewed changes

cluster-autoscaler/proposals/provisioning-request.md Show resolved Hide resolved

Bryce-Soghigian reviewed Jun 20, 2023

View reviewed changes

cluster-autoscaler/proposals/provisioning-request.md Show resolved Hide resolved

kannon92 reviewed Jun 22, 2023

View reviewed changes

cluster-autoscaler/proposals/provisioning-request.md Outdated Show resolved Hide resolved

kisieland force-pushed the aep-prov-req branch from bfddb12 to c72fd8d Compare June 26, 2023 11:55

kisieland force-pushed the aep-prov-req branch from c72fd8d to 7381eb3 Compare June 26, 2023 12:54

k8s-ci-robot added the api-review Categorizes an issue or PR as actively needing an API review. label Jun 26, 2023

alculquicondor reviewed Jun 27, 2023

View reviewed changes

cluster-autoscaler/proposals/provisioning-request.md Outdated Show resolved Hide resolved

cluster-autoscaler/proposals/provisioning-request.md Outdated Show resolved Hide resolved

cluster-autoscaler/proposals/provisioning-request.md Outdated Show resolved Hide resolved

kisieland force-pushed the aep-prov-req branch from d6a23c2 to ae255b2 Compare June 30, 2023 12:12

jpbetz reviewed Aug 17, 2023

View reviewed changes

cluster-autoscaler/proposals/provisioning-request.md Outdated Show resolved Hide resolved

liggitt reviewed Aug 17, 2023

View reviewed changes

cluster-autoscaler/proposals/provisioning-request.md Outdated Show resolved Hide resolved

jpbetz mentioned this pull request Aug 18, 2023

Update example for conditions to use SSA go tags kubernetes/community#7477

Merged

jonathan-innis reviewed Aug 23, 2023

View reviewed changes

k8s-ci-robot closed this in kubernetes/community#7477 Aug 25, 2023

mwielgus reopened this Aug 28, 2023

kisieland force-pushed the aep-prov-req branch from afe32fd to 4280d13 Compare August 28, 2023 15:42

kisieland requested review from liggitt, jpbetz and mwielgus August 28, 2023 15:42

kisieland force-pushed the aep-prov-req branch from 4280d13 to 429b416 Compare August 31, 2023 14:58

Introduce AEP with Provisioning Request CRD

09fb959

kisieland force-pushed the aep-prov-req branch from 429b416 to 09fb959 Compare September 1, 2023 08:06

mwielgus approved these changes Sep 11, 2023

View reviewed changes

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 11, 2023

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Sep 11, 2023

k8s-ci-robot merged commit 4e34992 into kubernetes:master Sep 11, 2023

kerthcet reviewed Sep 11, 2023

View reviewed changes

kisieland deleted the aep-prov-req branch September 12, 2023 11:28

kisieland mentioned this pull request Sep 13, 2023

Prov req client #6104

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce AEP with Provisioning Request CRD #5848

Introduce AEP with Provisioning Request CRD #5848

kisieland commented Jun 13, 2023

qianlei90 commented Jun 13, 2023

mwielgus commented Jun 14, 2023

qelo Jun 16, 2023

mwielgus Jun 16, 2023

kisieland Jun 19, 2023

qelo Jun 16, 2023

mwielgus Jun 16, 2023

severinson commented Jun 22, 2023

asm582 commented Jun 23, 2023

kisieland commented Jun 26, 2023

asm582 commented Jun 26, 2023

kisieland commented Jun 26, 2023

kisieland commented Jun 26, 2023

alculquicondor left a comment

kisieland commented Jun 30, 2023

jonathan-innis left a comment

kisieland commented Aug 28, 2023

mwielgus commented Sep 7, 2023

kisieland commented Sep 8, 2023 •

edited

Loading

mwielgus left a comment

k8s-ci-robot commented Sep 11, 2023

kerthcet left a comment

liggitt commented Sep 14, 2023 •

edited

Loading

mwielgus commented Sep 14, 2023

kisieland commented Sep 14, 2023

Introduce AEP with Provisioning Request CRD #5848

Introduce AEP with Provisioning Request CRD #5848

Conversation

kisieland commented Jun 13, 2023

What type of PR is this?

What this PR does / why we need it:

Does this PR introduce a user-facing change?

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

qianlei90 commented Jun 13, 2023

mwielgus commented Jun 14, 2023

qelo Jun 16, 2023

Choose a reason for hiding this comment

mwielgus Jun 16, 2023

Choose a reason for hiding this comment

kisieland Jun 19, 2023

Choose a reason for hiding this comment

qelo Jun 16, 2023

Choose a reason for hiding this comment

mwielgus Jun 16, 2023

Choose a reason for hiding this comment

severinson commented Jun 22, 2023

asm582 commented Jun 23, 2023

kisieland commented Jun 26, 2023

asm582 commented Jun 26, 2023

kisieland commented Jun 26, 2023

kisieland commented Jun 26, 2023

alculquicondor left a comment

Choose a reason for hiding this comment

kisieland commented Jun 30, 2023

jonathan-innis left a comment

Choose a reason for hiding this comment

kisieland commented Aug 28, 2023

mwielgus commented Sep 7, 2023

kisieland commented Sep 8, 2023 • edited Loading

mwielgus left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Sep 11, 2023

kerthcet left a comment

Choose a reason for hiding this comment

liggitt commented Sep 14, 2023 • edited Loading

mwielgus commented Sep 14, 2023

kisieland commented Sep 14, 2023

kisieland commented Sep 8, 2023 •

edited

Loading

liggitt commented Sep 14, 2023 •

edited

Loading