Apply and manage Pod rightsizing with CLI

This section describes how to apply and manage Pod rightsizing from the cluster using the CLI.

Pod rightsizing is configured declaratively using Kubernetes Policy custom resources (CRs). A policy defines Pod rightsizing behavior and how the policy selects workloads.

Policies select workloads using an appList selector, label selectors, or a policy annotation on the workload.

Each workload is managed by at most one policy at a time. When multiple policies match the same workload, precedence rules determine which policy manages it.

If no policy manages a workload, Pod rightsizing is not applied to it.

Workflow description

Policies are stored on the cluster as Kubernetes custom resources.

Use the following workflow to apply Pod rightsizing from the cluster:

Select a policy (built-in or custom).
Define workload selection (by using appList or appSelector label selectors, or a workload policy annotation).
Update the Policy resource in the cluster so the updated selectors and parameters take effect.

No Policy resources?
If your cluster does not show Policy resources, rather APC and JPC resources, you are running an older version of the Insights agent.
See Deprecated Pod rightsizing topics.

Kompass provides built-in policies, and you can also create custom policies.

The built-in policies - Balanced, Stability-focused, and Cost-focused - are described in Pod rightsizing workload optimization. These policies manage the rightsizing of both the CPU and memory workload resources.

You can see the built-in policy fixed parameters at Built-in policy fixed parameter values.

You cannot delete or change built-in policies parameters.

To manage a workload by policy, you edit the policy workload selection and priority values, as described in Apply and manage Pod rightsizing for workloads - CLI.

To have full control of policy parameters, you can create a custom policy. With custom policies you can also take advantage of the following:

Temporarily increase resources during Pod initialization.
For more information, see Boost startup resources.
Select to rightsize only CPU or only memory, not both.
For more information, see Selective Pod rightsizing.

For information about creating, editing, and deleting a custom policy, see Manage custom policies.

Configuration sample

Values inside [ ] are user-specified placeholders.

spec:
  appList: []
  appSelector:
    labels: {}
    namespaces:
      operator: Include
      values: []
  cooldownPeriod: 0s
  cpu:
    bufferPercentage: 30
    ignoreThrottlingEvent: false
    minimumImpactPercentage: 10
    minimumStepForScaleUp: 100m
    minimumValueForScaleDown: 10m
    optimizationStrategy: max
    replicaAggregationMethod: max
  dataCoveragePercentage: 80
  evictPodsInDefferedState: false
  lookbackPeriod: 5m
  memory:
    bufferPercentage: 40
    ignoreOomEvent: false
    minimumImpactPercentage: 10
    minimumStepForScaleUp: 256Mi
    minimumValueForScaleDown: 256Mi
    optimizationStrategy: max
    replicaAggregationMethod: max
  startupBoost:
 	duration: 5m
 	cpu:
   	  multiplier: 1.5
 	memory:
   	  multiplier: 3
  preferInPlace: true
  priority: 2

Policy parameters reference

All parameters are validated when a policy is created or updated. Policies with missing or invalid parameters cannot be applied.

Click here to see the policy parameter reference

Parameter	Description	Value
`spec.appList`	Explicit allowlist of workloads to manage with this policy.	List of workloads
`spec.appSelector.labels`	Label-based rules for selecting workloads to manage.	Label selector
`spec.appSelector.namespaces.operator`	Specifies whether to include or exclude the namespaces listed in `spec.appSelector.namespaces.values`.	`Include` \| Exclude
`spec.appSelector.namespaces.values`	Namespaces used by the namespace filter.	List of strings
`spec.cooldownPeriod`	Minimum time between consecutive Pod rightsizing changes. Longer periods promote greater stability.	Duration (for example, 2h, 30m, 0s)
`spec.cpu.bufferPercentage`	Additional CPU buffer to add to the recommended CPU request.	Percentage (integer 0–100)
`spec.cpu.ignoreThrottlingEvent`	Specifies whether to ignore CPU throttling rollback behavior.	`true` \| `false`
`spec.cpu.minimumImpactPercentage`	Minimum percentage difference required for Kompass to apply a CPU request change. Higher values increase stability (but may reduce optimization effectiveness).	Percentage (integer 0–100)
`spec.cpu.minimumStepForScaleUp`	Minimum CPU increment for scale-up changes.	CPU quantity (for example, `100m`)
`spec.cpu.minimumValueForScaleDown`	Minimum CPU request allowed for scale-down changes.	CPU quantity (for example, `10m`)
`spec.cpu.optimizationStrategy`	Specifies which CPU optimization strategy to apply.	`p95` \| `max` \| `p85`
`spec.cpu.replicaAggregationMethod`	Specifies how Pod replicas are aggregated when calculating CPU recommendations. You can configure custom values. For more information, see Define custom replica aggregation method values.	`p95` \| `max` \| `p85` \| `custom value 1`\| `custom value 2`
`spec.dataCoveragePercentage`	Minimum portion of the lookback period required before Kompass generates Pod rightsizing recommendations.	Percentage (integer 0–100)
`spec.evictPodsInDeferredState`	If a Pod need to be created on a new node, evict it so it can be rescheduled on a new node.	`true` \| `false`
`spec.lookbackPeriod`	Amount of historical workload data that Kompass uses to generate recommendations.	Duration (for example, `1d`, `12h`, `5m`)
`spec.memory.bufferPercentage`	Additional memory buffer to add to the recommended memory request.	Percentage (integer 0–100)
`spec.memory.ignoreOomEvent`	Specifies whether to ignore out-of-memory (OOM) rollback behavior.	`true` \| `false`
`spec.memory.minimumImpactPercentage`	Minimum percentage difference required for Kompass to apply a memory request change. Higher values increase stability (but may reduce optimization effectiveness).	Percentage (integer 0–100)
`spec.memory.minimumStepForScaleUp`	Minimum memory increment for scale-up changes.	Memory quantity (for example, `256Mi`)
`spec.memory.minimumValueForScaleDown`	Minimum memory request allowed for scale-down changes.	Memory quantity (for example, `256Mi`)
`spec.memory.optimizationStrategy`	Specifies which memory optimization strategy to apply.	`p99` \| `max` \| `p95`
`spec.memory.replicaAggregationMethod`	Specifies how Pod replicas are aggregated when calculating memory recommendations. You can configure custom values. For more information, see Define custom replica aggregation method values.	`p99` \| `max` \| `p95`\| `custom value 1`\| `custom value 2`
`spec.startupBoost.duration`	Duration for which the startup boost is applied after Pod creation.	Duration (for example, `2m`, `5m`, `10m`)
`spec.startupBoost.cpu.multiplier`	Multiplier applied to the CPU request during startup.	Decimal (for example, `1.5`)
`spec.startupBoost.memory.multiplier`	Multiplier applied to the memory request during startup.	Decimal (for example, `3`)
`spec.preferInPlace`	If in-place changes are supported, adjust the resource request using in-place changes.	`true` \| `false`
`spec.priority`	Policy priority. If a workload matches multiple policies, the policy with the lower numeric value has higher priority.	Integer (priority values start at 0)

Define custom replica aggregation method values

You can define custom values for the CPU and memory replica aggregation method. This can be useful if the fixed values are overly aggressive.

To define custom aggregation method figures, you define values for custom1 and custom2 in Helm values.yaml, then define the value in the policy configuration as custom value 1 or custom value 2.

To define custom values:

Add the custom definition to Helm values.yaml:

rightsizing:
  config:
    customReplicaAggregationMethods:
      cpu:
        custom1: 70
        custom2: 71
      memory:
        custom1: 80
        custom2: 81

Note that both CPU and memory are in the same section.

In the policy on the cluster, change the value to the desired value.
For example, to set the memory value to 80, change the value in the policy spec.memory.replicaAggregationMethod to custom value 1.

Boost startup resources

Some workloads require higher CPU or memory during initialization than during normal operation. To accommodate this, you would need to manually assign extra resources for startup (to prevent OOM or throttling) and then to reduce those assignments for ongoing operation.

With Pod rightsizing and Startup boost, you can temporarily increase CPU and memory resources for new Pods during initialization. After a configured duration, the resources automatically revert to initial values and then Pod rightsizing is applied.

With Startup boost:

Startup-related spikes are excluded from the steady state recommendation logic.
Steady-state requests are aligned with actual runtime usage.

You apply Startup boost to a workload by adding startupBoost to a custom Pod rightsizing policy that manages the workload.

spec:
   startupBoost:
 	duration: 5m
 	cpu:
   	  multiplier: 1.5
 	memory:
   	  multiplier: 3

For more information about creating and editing custom policies, see Create and manage custom policies - CLI.

For the description of the parameters, see Policy parameters reference.

Startup boost limitations and considerations

Applies to all containers in a Pod.
Requires in-place Pod mutation support.
Cannot be used if preferInPlace: false
If a Pod needs to be evicted to a different node due to insufficient node capacity, it will be evicted even if the policy includes: spec.evictPodsInDeferredState = false

Selective Pod rightsizing

There may be cases where you want to manually manage workload memory resources while allowing Pod rightsizing to manage CPU resources, or vice versa.

For example, you may want separate control for memory-critical or CPU-critical applications (such as Java applications), or for testing purposes.

With Selective Pod rightsizing, you can apply Pod rightsizing to only one resource while manually managing the other resource.

To use Selective Pod rightsizing, create a custom policy and remove the entire configuration block for the resource that you want to manage manually.

For example, to apply Pod rightsizing only to CPU resources, remove the entire memory section from the policy:

memory:
  bufferPercentage: 40
  ignoreOomEvent: false
  minimumImpactPercentage: 10
  minimumStepForScaleUp: 256Mi
  minimumValueForScaleDown: 256Mi
  optimizationStrategy: max
  replicaAggregationMethod: max