Kubernetes requests & limits: practical sizing

Scheduling is based on requests. That’s why most capacity planning starts with requests and then uses limits to reason about burst and risk.

1) Requests drive node count

A simple approach: total requests = pods × per-pod request. Then divide by allocatable per node. Our calculator does that and takes the larger of CPU-based and memory-based counts.

2) Leave allocatable headroom

Nodes aren’t 100% allocatable. System overhead, daemonsets, and kubelet reservations reduce usable capacity. Planning with 85–95% allocatable is common depending on your environment.

3) Limits matter for burst behavior

  • CPU limits can throttle bursts.
  • Memory limits can lead to OOM kills if pods exceed limits.

Tool

Kubernetes requests & limits calculator


Last updated: 2026-01-06