New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Intro carpenter #2425

Open

pipo02mix wants to merge 8 commits into main from intro-carpenter

+167 −16

Contributor

pipo02mix commented Dec 13, 2024

What this PR does / why we need it

Towards https://github.com/giantswarm/giantswarm/issues/29503

Things to check/remember before submitting

If it's one of your first contributions, make sure you've read the Contributing Guidelines.
Bump last_review_date in the front matter header of the pages you've touched.


          Add cluster autoscaling with karpenter

127dfee

pipo02mix requested a review from a team as a code owner

December 13, 2024 15:04

pipo02mix marked this pull request as draft

December 13, 2024 15:04

pipo02mix self-assigned this

marians reviewed

View reviewed changes

src/content/tutorials/fleet-management/cluster-management/cluster-scaling/_index.md Outdated Show resolved Hide resolved

pipo02mix and others added 4 commits

December 17, 2024 12:50


          Merge branch 'main' into intro-carpenter

9224c23


          Apply suggestions from code review

bbe7e97

Co-authored-by: Marian Steinbach <[email protected]>


          First version

fd8b6b9


          --amend

8f6ed89

Contributor

github-actions bot commented Dec 17, 2024

This PR moves/renames or deletes some files. Please make sure to

maintain references (also important for images)
Maintain aliases in the front matter of moved markdown files

Contributor

github-actions bot commented Dec 17, 2024 •

edited

Loading

Hugo yielded some warnings. Please check whether they require action.

WARN  Template shortcodes/autoscaling_supported_versions.html is unused, source file /home/runner/work/docs/docs/src/layouts/shortcodes/autoscaling_supported_versions.html

pipo02mix marked this pull request as ready for review

December 17, 2024 15:36


          Merge branch 'main' into intro-carpenter

f39b75e

pipo02mix commented

View reviewed changes

src/content/tutorials/fleet-management/cluster-management/aws-cluster-scaling/_index.md

		To avoid collisions between both, the cluster autoscaler is configured to have a lower priority than `Karpenter`, so it will react only after a pod is on `Pending` for a while (default 5 minutes).

		## Configuration

Contributor Author

pipo02mix Dec 17, 2024

@T-Kukawka does it come installed in CAPA WC by default?

Contributor

AndiDog Dec 30, 2024

It doesn't come pre-installed. And therefore, we should clarify at the top of the article that it's a custom addition at the moment, while cluster-autoscaler is built-in and works fine out of the box.

pipo02mix requested review from a team and marians

December 17, 2024 15:37

pipo02mix added 2 commits

December 17, 2024 17:00


          Fix link

da77d32


          Merge branch 'intro-carpenter' of github.com:giantswarm/docs into int…

75d58c6

…ro-carpenter

iuriaranda reviewed

View reviewed changes

src/content/tutorials/fleet-management/cluster-management/aws-cluster-scaling/_index.md


		At Giant Swarm, your workload clusters run with [cluster autoscaler](https://github.com/kubernetes/autoscaler) and [`Karpenter`](https://karpenter.sh/) to reach optimal scaling for your workloads and keeping the costs at minimum. This tutorial will guide you through the configuration and management of both.

		The cluster autoscaler is responsible for scaling the number of nodes on the different node pools of your workload cluster. It's triggered by not schedule pods, pods in `Pending` state, making the controller increase the number of desired nodes in the node pool. Indeed it modifies the `AutoScalingGroup` to reflect the new desired capacity.

Contributor

iuriaranda Dec 31, 2024

Suggested change

      
            The cluster autoscaler is responsible for scaling the number of nodes on the different node pools of your workload cluster. It's triggered by not schedule pods, pods in `Pending` state, making the controller increase the number of desired nodes in the node pool. Indeed it modifies the `AutoScalingGroup` to reflect the new desired capacity.
          
            The cluster autoscaler is responsible for scaling the number of nodes on the different node pools of your workload cluster. It's triggered by not scheduled pods, pods in `Pending` state, making the controller increase the number of desired nodes in the node pool. Indeed it modifies the `AutoScalingGroup` to reflect the new desired capacity.

src/content/tutorials/fleet-management/cluster-management/aws-cluster-scaling/_index.md


		The cluster autoscaler is responsible for scaling the number of nodes on the different node pools of your workload cluster. It's triggered by not schedule pods, pods in `Pending` state, making the controller increase the number of desired nodes in the node pool. Indeed it modifies the `AutoScalingGroup` to reflect the new desired capacity.

		Instead, `Karpenter` relies on the Kubernetes events to scale up or down the number of nodes in the cluster. It's select from a suite of instance types defined in a special `Provisioner` resources to match the workload requirements and can be configured to use spot instances to save costs. It's faster and more efficient than the cluster autoscaler, but does not operate well with base on-demand instances.

Contributor

iuriaranda Dec 31, 2024

Why do you say it doesn't operate well with base on-demand instances?

src/content/tutorials/fleet-management/cluster-management/aws-cluster-scaling/_index.md


		Our recommendation for the autoscaling configuration is to set two different profiles. One will target `Spot` compute and the other `On-Demand` instances. The `Spot` profile will have a higher weight to be prioritized over the `On-Demand` profile. And the `on-demand` profile will ensure that the cluster has a base capacity to handle the main workloads.

		First, let's dive in what is a `Provisioner` custom resource to understand how to configure it. There are a set of parameters to help you define how the nodes should be provisioned:

Contributor

iuriaranda Dec 31, 2024

Suggested change

      
            First, let's dive in what is a `Provisioner` custom resource to understand how to configure it. There are a set of parameters to help you define how the nodes should be provisioned:
          
            First, let's dive into what a `Provisioner` custom resource is to understand how to configure it. There are a set of parameters to help you define how the nodes should be provisioned:

src/content/tutorials/fleet-management/cluster-management/aws-cluster-scaling/_index.md

+              First, let's dive in what is a `Provisioner` custom resource to understand how to configure it. There are a set of parameters to help you define how the nodes should be provisioned:
+              - **labels**: Used to select which nodes should be managed by the provisioner.
+              - **limits**: Define the resources limits for the nodes.

Contributor

iuriaranda Dec 31, 2024

Suggested change

      
            - **limits**: Define the resources limits for the nodes.
          
            - **limits**: Lets you set limits on the total CPU and Memory that can be used by the node pool, effectively stopping further node provisioning when those limits have been reached.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet