Amazon EC2 Spot Workshops > EKS with Karpenter > Karpenter > Using Alternative Provisioners

Using Alternative Provisioners

STOP Please note: this EKS and Karpenter workshop version is now deprecated since the launch of Karpenter v1beta, and has been updated to a new home on AWS Workshop Studio here: Karpenter: Amazon EKS Best Practice and Cloud Cost Optimization.

This workshop remains here for reference to those who have used this workshop before, or those who want to reference this workshop for running Karpenter on version v1alpha5.

So far we have seen some advanced use cases of Karpenter. In this section we will see how Karpenter can define different Provisioners. This allows to handle different configurations.

Each Provisioner CRD (Custom Resource Definition) provides a set of unique configurations that define the resources it supports as well as labels and taints that will also be applied to the new resources created by that Provisioner. In large clusters with multiple applications, new applications may need to create nodes with specific Taints or specific labels. In these scenarios you can configure alternative Provisioners. For this workshop we have already defined a team1 Provisioner. You can list the available Provisioners by running the following command:

kubectl get provisioners

Creating a Deployment that uses the `team1` Provisioner.

Let’s create a new deployment this time, let’s force the deployment to use the team1 provisioner.

cat <<EOF > inflate-team1.yaml
apiVersion: apps/v1
kind: Deployment
metadata:
  name: inflate-team1
spec:
  replicas: 0
  selector:
    matchLabels:
      app: inflate-team1
  template:
    metadata:
      labels:
        app: inflate-team1
    spec:
      nodeSelector:
        intent: apps
        kubernetes.io/arch: amd64
        karpenter.sh/provisioner-name: team1
      containers:
      - image: public.ecr.aws/eks-distro/kubernetes/pause:3.2
        name: inflate-team1
        resources:
          requests:
            cpu: "1"
            memory: 256M
      tolerations:
      - key: team1
        operator: Exists
      topologySpreadConstraints:
      - labelSelector:
          matchLabels:
            app: inflate-team1
        maxSkew: 1
        topologyKey: topology.kubernetes.io/zone
        whenUnsatisfiable: ScheduleAnyway
EOF
kubectl apply -f inflate-team1.yaml

There seem to be quite a few new entries in this section! Let’s cover a few of those. The rest we will discuss in the Challenge section.

The deployment sets the NodeSelector intent: apps so that this application does not overlap with the Managed Node group created with the cluster.
The Node Selector karpenter.sh/provisioner-name: team1 is also set. This NodeSelector is used by Karpenter to select which Provisioner configuration should be used when Provisioning capacity for this deployment. This allow different provisioner to for example define different Taint sections.
The NodeSelector kubernetes.io/arch has been set to use x86_64 amd64 instances.
If you recall well the team1 provisioner had a section that defined a team1 taint. The deployment must add also that toleration so it’s allowed to be placed in the newly created instances with the Taint team1: NoSchedule. Note the NoSchedule means that only applications that have the toleration for team1 will be allowed on it.

Challenge

You can use Kube-ops-view or just plain kubectl cli to visualize the changes and answer the questions below. In the answers we will provide the CLI commands that will help you check the resposnes. Remember: to get the url of kube-ops-view you can run the following command kubectl get svc kube-ops-view | tail -n 1 | awk '{ print "Kube-ops-view URL = http://"$4 }'

Answer the following questions. You can expand each question to get a detailed answer and validate your understanding.

1) How would you Scale the `inflate-team1` deployment to 4 replicas ?

Click here to show the answer

To scale up the deployment run the command:

kubectl scale deployment inflate-team1 --replicas 4

We should be able to see the number of replicas deployed by running the command below and looking for the instances of inflate-team1 pods. There should be 4 of them.

kubectl get pods

2) Which Provisioner did Karpenter use ? Which Nodes where selected ?

Click here to show the answer

Over this workshop we’ve used a few mechanisms. We can run the following command to check the nodes created and the properties label properties for those.

kubectl get node --selector=intent=apps,karpenter.sh/provisioner-name=team1 --show-labels

This should display an output similar to the one that follows. From this output below in our example we can see that the instance selected was on-demand and of type t3a.xlarge.

NAME                                            STATUS     ROLES    AGE   VERSION               LABELS
xxxxxxxxxxxxxxxxxxxxxxxxxxxx.compute.internal   Ready      <none>   66s   v1.21.5-eks-bc4871b   beta.kubernetes.io/arch=amd64,beta.kubernetes.io/instance-type=t3a.xlarge,beta.kubernetes.io/os=linux,failure-domain.beta.kubernetes.io/region=eu-west-1,failure-domain.beta.kubernetes.io/zone=eu-west-1b,intent=apps,karpenter.sh/provisioner-name=team1,kubernetes.io/arch=amd64,kubernetes.io/hostname=xxxxxxxxxxxxxxxxxxxxxxxxxxxx.compute.internal,kubernetes.io/os=linux,kubernetes.sh/capacity-type=on-demand,node.kubernetes.io/instance-type=t3a.xlarge,topology.kubernetes.io/region=eu-west-1,topology.kubernetes.io/zone=eu-west-1b
xxxxxxxxxxxxxxxxxxxxxxxxxxx.compute.internal    Ready      <none>   66s   v1.21.5-eks-bc4871b   beta.kubernetes.io/arch=amd64,beta.kubernetes.io/instance-type=t3a.xlarge,beta.kubernetes.io/os=linux,failure-domain.beta.kubernetes.io/region=eu-west-1,failure-domain.beta.kubernetes.io/zone=eu-west-1a,intent=apps,karpenter.sh/provisioner-name=team1,kubernetes.io/arch=amd64,kubernetes.io/hostname=xxxxxxxxxxxxxxxxxxxxxxxxxxx.compute.internal,kubernetes.io/os=linux,kubernetes.sh/capacity-type=on-demand,node.kubernetes.io/instance-type=t3a.xlarge,topology.kubernetes.io/region=eu-west-1,topology.kubernetes.io/zone=eu-west-1a

But there is something that does not match with what we have seen so far with Karpenter. In previous scenarios Karpenter will be bin-packing instances to fit the workload. In this case 2 instances have been created one on each AZ (in this case eu-west-1a and eu-west-1b) !

2) Why did Karpenter split the creation into two different nodes instead of bin-packing like in the previous scenarios ?

Click here to show the answer

Well, let’s check first Karpenter log.

alias kl='kubectl -n karpenter logs -l app.kubernetes.io/name=karpenter --all-containers=true -f --tail=20'
kl

The output of Karpenter should look similar to the one below

...
2022-09-05T11:11:33.993Z        DEBUG   controller.provisioning 27 out of 509 instance types were excluded because they would breach provisioner limits {"commit": "b157d45"}
2022-09-05T11:11:33.993Z        DEBUG   controller.provisioning 27 out of 509 instance types were excluded because they would breach provisioner limits {"commit": "b157d45"}
2022-09-05T11:11:33.999Z        DEBUG   controller.provisioning 27 out of 509 instance types were excluded because they would breach provisioner limits {"commit": "b157d45"}
2022-09-05T11:11:33.999Z        DEBUG   controller.provisioning 381 out of 509 instance types were excluded because they would breach provisioner limits       {"commit": "b157d45"}
2022-09-05T11:11:34.006Z        INFO    controller.provisioning Found 4 provisionable pod(s)    {"commit": "b157d45"}
2022-09-05T11:11:34.006Z        INFO    controller.provisioning Computed 2 new node(s) will fit 4 pod(s)        {"commit": "b157d45"}
2022-09-05T11:11:34.007Z        INFO    controller.provisioning Launching node with 2 pods requesting {"cpu":"2125m","memory":"512M","pods":"4"} from types t3a.xlarge, c6a.xlarge, c5a.xlarge, t3.xlarge, c6i.xlarge and 35 other(s)   {"commit": "b157d45", "provisioner": "team1"}
2022-09-05T11:11:34.014Z        INFO    controller.provisioning Launching node with 2 pods requesting {"cpu":"2125m","memory":"512M","pods":"4"} from types t3a.xlarge, c6a.xlarge, c5a.xlarge, t3.xlarge, c6i.xlarge and 325 other(s)  {"commit": "b157d45", "provisioner": "team1"}
2022-09-05T11:11:34.342Z        DEBUG   controller.provisioning.cloudprovider  Discovered launch template Karpenter-eksworkshop-eksctl-14752700009555043417    {"commit": "b157d45", "provisioner": "team1"}
2022-09-05T11:11:36.601Z        DEBUG   controller.provisioning.cloudprovider  InsufficientInstanceCapacity for offering { instanceType: t3a.xlarge, zone: eu-west-1b, capacityType: on-demand }, avoiding for 3m0s     {"commit": "b157d45", "provisioner": "team1"}
2022-09-05T11:11:36.748Z        INFO    controller.provisioning.cloudprovider  Launched instance: i-0b44228e7195f7588, hostname: ip-192-168-42-207.eu-west-1.compute.internal, type: c6a.xlarge, zone: eu-west-1b, capacityType: on-demand     {"commit": "b157d45", "provisioner": "team1"}
2022-09-05T11:11:38.400Z        INFO    controller.provisioning.cloudprovider  Launched instance: i-0e5173a4f48019515, hostname: ip-192-168-31-229.eu-west-1.compute.internal, type: t3a.xlarge, zone: eu-west-1a, capacityType: on-demand     {"commit": "b157d45", "provisioner": "team1"}
...

It’s interesting to see how the nodes were selected in different availability zones.
As you are probably guessing by now, this may have to do with the section of the deployment section for topologySpreadConstraints: . Latest versions of Karpenter (>0.4.1) support Kubernetes Topology Spread Constraints. You can define one or multiple topologySpreadConstraint to instruct the kube-scheduler how to place each incoming Pod in relation to the existing Pods across your cluster.

To all effect the bin-packing is still happening is just that the topologySpreadConstraint is applied and forces us to spread the workload across the available kubernetes.io/zone

Check out as well the details in the log. If you recall the Node template in the provider section for the team1 Provisioner, used Bottlerocket with a custom additional bootstrapping. You can see how the logs showcase Karpenter creating the Karpenter-eksworkshop-eksctl-641081096202606695 launch template, and adapting it according to the latest version of the AMI and the bootstrapping added to the configuration. This simlifies significantly the life-cycle management and patching of EC2 Instances.

3) Scale both deployments to 0 replicas ?

Click here to show the answer

This one should be really easy. Just run the following command

To scale up the deployment run the command:

kubectl scale deployment inflate-team1 --replicas 0

What Have we learned in this section :

In this section we have learned:

Applications requiring specific labels or Taints can make use of alternative Provisioners that are customized for that set of applications. This is a common setup for large clusters.
Pods can select the Provisioner by setting a nodeSelector with the lable karpenter.sh/provisioner-name pointing to the right Provisioner.
Karpenter supports topologySpreadConstraints. Topology Spread constraints instruct the kube-scheduler how to place each incoming Pod in relation to the existing Pods across your cluster. In this scenario we discover how to balance pods across Availability Zones.
For AL2, Ubuntu and Bottlerocket AMI’s Karpenter does the heavy-lifting of managing the underlying Launch Templates keeping AMI’s up to dates. Karpenter also allows us to configure extra bootstrapping parameters without us having to manage Launch Templates, this significanlty simplifies the life-cycle management and patching of EC2 Instances while removing the heavy-lifting required to apply bootstrapping additional parameters.

Using Alternative Provisioners

Creating a Deployment that uses the team1 Provisioner.

Challenge

1) How would you Scale the inflate-team1 deployment to 4 replicas ?

2) Which Provisioner did Karpenter use ? Which Nodes where selected ?

2) Why did Karpenter split the creation into two different nodes instead of bin-packing like in the previous scenarios ?

3) Scale both deployments to 0 replicas ?

What Have we learned in this section :

Site Info

Creating a Deployment that uses the `team1` Provisioner.

1) How would you Scale the `inflate-team1` deployment to 4 replicas ?