Load balancer LCU/NLCU explained (for cost estimates)

Reviewed by CloudCostKit Editorial Team. Last updated: 2026-01-27. Editorial policy and methodology.

Start with a calculator if you need a first-pass estimate, then use this guide to validate the assumptions and catch the billing traps.


This page is the support explainer for the load balancer cluster, not the bill-boundary page and not the measurement workflow page: it exists to make LCU and NLCU unit logic easier to reason about once you know which page job you actually need. Many load balancers charge (1) a fixed hourly fee plus (2) a usage fee billed in capacity unit-hours.

If you need to decide what belongs inside the load balancer bill, go back to the pricing guide first; if you need to turn metrics into a defendable units-per-hour model, use the estimate guide. Open the pricing guide or open the estimate guide.

What “capacity unit-hours” means

Think of LCU/NLCU as a normalized “how busy was this load balancer this hour?” score. The unit is typically derived from multiple dimensions, and the billed unit-hours often follow the maximum of those dimensions.

  • Connections: new connections and/or active connections
  • Bytes processed: traffic volume through the LB
  • Request processing: rules/routing work (depends on product and configuration)

Why the same RPS can produce very different unit-hours

  • Payload size: 1kB responses vs 1MB downloads are not comparable.
  • Connection churn: short timeouts and frequent reconnects inflate new connections.
  • Long-lived connections: streaming/WebSockets increase active connections.
  • Incidents: retries can multiply requests and connections without increasing “real” business volume.

A practical mental model for optimization

  • If you have many LBs, LB-hours dominate: reduce load balancer count.
  • If you have a few hot LBs, unit-hours dominate: reduce bytes processed and connection churn.
  • If you have spikes, the “peak scenario” dominates: fix retries and bot traffic.

Optimization playbook: load balancer cost optimization

How to estimate units/hour (without getting lost)

  1. Pick a representative week and a peak definition (p95 hour or incident hour).
  2. Collect driver metrics: new connections/sec, active connections, bytes processed GB/hour.
  3. Use a calculator to convert driver metrics to units/hour.
  4. Price units/hour + fixed LB hourly fee, then validate after a week.

Common pitfalls

  • Budgeting from peak unit-hours for the whole month (hides the real average).
  • Ignoring payload size and estimating from requests only.
  • Missing retry storms and noisy clients as the “hidden multiplier”.
  • Mixing units (GB vs GiB, bits vs bytes) and silently breaking the estimate.
  • Not re-checking after architecture changes (CDN, compression, routing).

Quick diagnostic: which driver is dominating?

When you look at a week of metrics, one driver is usually “the max” most of the time. You can often predict the culprit from traffic shape:

  • Bytes dominate: large responses, downloads, missing compression, no CDN offload.
  • New connections dominate: short timeouts, clients reconnecting, lack of keep-alive.
  • Active connections dominate: streaming, long polling, WebSockets.
  • Rules dominate: complex routing rules that evaluate frequently.

Once you know the dominant driver, optimization becomes targeted instead of guesswork.

Sources


Related guides

Estimate ALB LCU (and NLB NLCU) from metrics: quick methods
A practical guide to estimate ALB LCU and NLB NLCU from load balancer metrics: new connections, active connections, bytes processed, and rule evaluations — with a repeatable workflow and validation steps.
ALB vs NLB cost: how to choose and estimate (LCU vs NLCU)
Compare ALB vs NLB cost by breaking the bill into LB-hours, ALB LCU drivers, NLB connection and byte drivers, and the traffic shape that decides which load balancer is actually cheaper.
ECS cost model beyond compute: the checklist that prevents surprise bills
A practical ECS cost model checklist beyond compute: load balancers, logs/metrics, NAT/egress, cross-AZ transfer, storage, and image registry behavior. Use it to avoid underestimating total ECS cost.
EKS pricing: what to include in a realistic cost estimate
Estimate EKS pricing by separating nodes, control plane, load balancers, storage, logs and metrics, and data transfer so the Kubernetes bill is broken into real cost surfaces before optimization starts.
Estimate WAF request volume (CDN/LB to monthly requests)
Estimate AWS WAF evaluated requests from CDN or load balancer metrics, log samples, attack windows, and bot spikes so monthly request models reflect baseline traffic and incident-heavy months.
Load balancer cost optimization (high-leverage fixes)
A practical playbook to reduce load balancer costs: cut LB-hours, reduce LCU/NLCU drivers (connections/bytes/requests), and prevent incident traffic amplification with a measurable validation plan.

Related calculators


FAQ

What is an LCU/NLCU?
A capacity unit is a provider-defined measure of load balancer usage. Billing is commonly per unit-hour and is driven by dimensions like connections and bytes processed.
Why is capacity unit billing confusing?
Because it’s not just request count. Two systems with the same RPS can have very different unit-hours based on payload size, connection churn, long-lived connections, and rule/routing behavior.
How do I estimate unit-hours without perfect observability?
Start from a few measurable drivers (connections/sec, active connections, GB/hour). Build an average + peak scenario, then validate and replace assumptions with metrics after a week.

Last updated: 2026-01-27. Reviewed against CloudCostKit methodology and current provider documentation. See the Editorial Policy .