In-Place Pod Resize

In-Place Pod Resize is a Kubernetes feature that allows changes to a container’s CPU and memory resource requests without restarting the Pod. This enables real-time vertical scaling for running workloads, removing the traditional requirement to evict or relaunch Pods when updating resource limits.

Launch Context

In-Place Pod Resize was introduced as a beta feature in Kubernetes v1.33, designed to address a long-standing challenge in Kubernetes vertical scaling. Prior to this, adjusting a Pod’s resource requests required a full restart—an approach that introduced downtime and complexity for many production workloads.

How It Works

With In-Place Pod Resize, updates to CPU and memory requests are applied live by patching the Pod specification. Kubernetes attempts to modify the container’s resource requests in place, allowing the Pod to continue running without disruption.

In cases where the resized Pod no longer fits on its current node, Kubernetes will mark it as “infeasible” and prevent it from being scheduled. Advanced implementations—like Zesty’s Pod Rightsizing—address this by:

Detecting infeasible Pods,
Gradually evicting them in a controlled rollout to minimize disruption,
Recreating them on nodes with sufficient capacity,
Applying updated resource requests via a mutation webhook.

This approach ensures minimal service interruption while maintaining optimal resource allocation.

Value Proposition

In-Place Pod Resize brings significant operational and financial benefits to Kubernetes environments:

Zero Downtime Scaling: Eliminate disruptions tied to pod restarts, which is critical for stateful applications and SLAs.
Smarter Resource Allocation: Fine-tune CPU and memory resources in real time based on current needs.
Improved Cost Efficiency: Avoid overprovisioning by dynamically adjusting resources instead of reserving excess capacity.
Compatibility with HPA: In-Place Pod Resize removes blockers to running Vertical Pod Autoscaler (VPA) alongside Horizontal Pod Autoscaler (HPA).

Benefits

No Service Disruption: Maintain availability during scaling operations.
Operational Simplicity: Reduce the need for manual resource tuning and restart logic.
More Precise Scaling: Enable per-container tuning, especially useful in multi-container Pods.
FinOps-Aligned Optimization: Helps organizations move from static provisioning to demand-based scaling.

Use Cases

Production APIs: Adjust memory or CPU on live services under varying load.
Stateful Workloads: Scale vertically without risking application state loss.
VPA + HPA Environments: Combine vertical and horizontal scaling strategies effectively.
Bursting Workloads: React to short-term spikes in usage without service disruption.

Challenges

Node Fit Limitations: Enlarged Pods may not fit on their current nodes, requiring rescheduling.
Operational Complexity: Coordinating evictions, rollout strategy, and webhook mutation may require additional tooling.
Visibility & Controls: Teams need observability into scaling behavior and mechanisms for safe policy enforcement.

Integration with Zesty

Zesty’s Pod Rightsizing solution now supports In-Place Pod Resize, further enhancing its real-time optimization capabilities. When additional CPU or memory is needed:

Zesty patches the resource requests without restarting the Pod.
If the Pod becomes infeasible on its current node, Zesty gradually evicts and reschedules it in a controlled manner.
The rescheduled Pod passes through a mutation webhook, ensuring updated resource requests are applied seamlessly.

This integration enables fully automated, zero-downtime vertical scaling as part of Zesty’s broader Kubernetes optimization toolkit.

Related Concepts

References

Check out related topics

External Secrets Operator

External Secrets Operator (ESO) is a Kubernetes-native controller that allows you to securely sync secrets from external secr…

Canary Deployments

A Canary deployment is a software release strategy that gradually rolls out a new version of an application to a small subset…

Kubernetes Management

Kubernetes management refers to the comprehensive set of practices, tools, and operational strategies used to deploy, monitor…

Kustomize

Kustomize is a Kubernetes-native configuration management tool that allows you to customize and template Kubernetes manifests…

Calico vs. Cilium in Kubernetes Networking

Calico and Cilium are two of the most widely used Container Network Interface (CNI) plugins in Kubernetes, responsible for en…

Kyverno vs. OPA: Kubernetes Policy Engines

Kubernetes has made it easier to scale and manage containerized applications, but with that flexibility comes the need for st…

info@zesty.co

Platform

Solutions

Company

Resources

Proud to be

AWS Partnership

SOC 2

ADVANCED TECHNOLOGY PARTNER

Resource Optimization

Financial Optimization

Visibility & Recommendations

What's new

Use cases

See how Zesty works

Get to know Zesty

Hear it from out Customers

For developers

Platform learning

Industry learning

Learn Kubernetes

Zesty Blog