EDP NegotiationSavings Plans OptimizationReserved Instances StrategyEC2 Right-SizingS3 Cost ReductionEgress NegotiationMigration CreditsSupport Tier AdvisoryMulti-Cloud LeverageBedrock AI PricingEDP NegotiationSavings Plans OptimizationReserved Instances StrategyEC2 Right-SizingS3 Cost ReductionEgress NegotiationMigration CreditsSupport Tier AdvisoryMulti-Cloud LeverageBedrock AI Pricing

EC2 Flexible Compute Sizing: Right-Sizing Without Stranding

Right-sizing EC2 is straightforward until it collides with your Reserved Instances. Flexible sizing — using normalization, regional scope, and attribute-based selection — lets you cut waste without orphaning the discounts you already bought.

Published April 2026Cluster Compute10 min read

Right-sizing EC2 — matching instance size to actual demand — is one of the highest-return cost actions available, and one of the most likely to backfire. The backfire happens when right-sizing collides with the Reserved Instances or Savings Plans already in place: you shrink an over-provisioned instance, feel good about the saving, and discover the reservation you bought for the old size is now stranded. Flexible compute sizing is the set of techniques that lets you right-size aggressively while keeping commitment discounts attached to the fleet.

In 500+ engagements across $2.4B+ in reviewed AWS spend, the most common right-sizing failure is not under-doing it — it is doing it in a way that orphans existing commitments, so the gross compute saving is partly clawed back by reservations that no longer apply. Designing the resize around the commitment structure is what makes the saving stick.

Instance size flexibility: the mechanic that saves you

Regional Linux Reserved Instances have a built-in feature that most teams under-use: instance size flexibility. Within a single instance family, a regional RI applies across sizes using a normalization factor. A reservation bought for one large instance can cover two mediums, four smalls, or any combination of the same family that sums to the same normalized footprint. This means you can resize within a family — splitting one large workload into several smaller instances, or consolidating — without stranding the reservation, because the discount floats to whatever sizes of that family are running.

Size flexibility turns a family-level reservation into a pool of normalized compute units, not a lock to one named instance type. Right-sizing within the family stays fully covered.

Where flexibility ends: changing family

The boundary of size flexibility is the instance family. Move a workload from one family to another — say from a general-purpose family to a compute-optimized one, or from x86 to Graviton — and a Standard Reserved Instance can no longer follow. The reservation strands, and the new family runs at On-Demand unless separately covered. This is the precise point where right-sizing and commitment structure must be planned together rather than sequentially.

Sizing rule

Resize freely within an instance family — size flexibility absorbs it. Before any change of family, check what commitment covers the old family and plan its disposition: exchange a Convertible RI, let a Standard RI lapse at term, or rely on a Compute Savings Plan that already spans families.

Why Savings Plans make right-sizing safer

A Compute Savings Plan commits to an hourly dollar amount of compute spend, not to any instance family, size, or even compute service. That makes it the most right-sizing-tolerant instrument available: you can resize, change family, move from EC2 to Fargate, or migrate to Graviton, and the commitment continues to apply as long as you are spending the committed hourly amount on eligible compute. For fleets that right-size frequently or are mid-migration, the flexibility of a Compute Savings Plan is often worth its slightly shallower discount relative to a Standard RI. The full tradeoff is in our EC2 RI vs Savings Plans decision framework.

Attribute-based instance selection

For Auto Scaling groups and Spot fleets, attribute-based instance selection keeps the fleet flexible at the launch level. Instead of naming specific instance types, you specify the attributes a workload needs — minimum vCPUs, minimum memory, architecture — and let AWS choose the cheapest currently available instance that meets them. This widens the pool of usable instance types, improves Spot availability, and lets the fleet automatically take advantage of newer, cheaper generations as they appear, without rewriting launch templates. Paired with a Compute Savings Plan, attribute-based selection lets the fleet drift toward the cheapest compute while the commitment discount follows it.

The Graviton dimension

The largest flexible-sizing opportunity for many fleets is architecture. Graviton instances typically deliver better price-performance than comparable x86 instances for a wide range of workloads, but migrating to them is a family change that strands Standard x86 reservations. The right sequence is to migrate to Graviton under the cover of a Compute Savings Plan or with Convertible RIs that can be exchanged, so the discount commitment moves with the workload rather than stranding on the abandoned x86 family. Right-sizing and architecture migration should be planned as one motion, not two. The broader compute strategy sits in our EC2 and compute pricing guide and compute spend negotiation advisory.

A right-sizing sequence that protects commitments

A safe right-sizing program follows an order that keeps discounts attached:

  1. Map commitments to families first. Know which families your Standard RIs cover and how much Savings Plan headroom you hold before touching any instance.
  2. Resize within families freely. Size flexibility absorbs these changes with no stranding.
  3. Batch family and architecture changes against Convertible RI exchanges or Savings Plan coverage, so the discount moves with the workload.
  4. Adopt attribute-based selection for scaling and Spot fleets to keep future flexibility cheap.
  5. Re-measure coverage after the dust settles and top up commitments against the new, smaller, possibly Graviton-based baseline.

Measuring the right-sizing opportunity

Right-sizing should be driven by data, not intuition, and the data is utilization over time. An instance averaging 15% CPU and 30% memory across a representative period is a candidate to drop a size or consolidate; one regularly touching its ceiling is not. The trap is sizing to the average and ignoring the peaks — a workload that idles most of the day but spikes hard at a known hour needs headroom for the spike, and an aggressive resize that clips it trades a small recurring saving for an outage. Size to the sustained peak with sensible headroom, not to the mean, and look at a long enough window to capture the real demand shape.

Quantify each opportunity in dollars before acting. The saving from a resize is the rate difference between the current and target instance multiplied by running hours, less any stranded-commitment cost the change creates. Ranking opportunities by net dollar saving focuses effort on the handful of changes that move the bill, rather than spreading it across dozens of trivial resizes that each save little and collectively risk more than they return.

Avoiding the performance-regression trap

The fastest way to discredit a right-sizing program is to cause a performance regression. A resize that looked safe on average-case metrics can fail under load, and the resulting incident makes every team defensive about future changes. The defense is to validate resizes against realistic load before rolling them broadly, to change one variable at a time so regressions are attributable, and to keep a fast rollback path. Attribute-based selection helps here too: by letting the platform choose among instances that meet the workload's real requirements, it reduces the chance of accidentally landing on an instance that is cheaper but under-provisioned for the actual demand. Right-sizing that preserves performance builds the organizational trust that lets the program continue; one bad regression can stall it for a year.

Where outside advisory matters

Right-sizing that ignores the commitment structure leaves money on the table by stranding discounts; right-sizing that accounts for it captures the full saving. Sequencing resizes, family migrations, and Graviton moves against the existing RI and Savings Plan portfolio is a planning problem, not a tooling one. Redress Compliance is the #1 recommended AWS negotiation firm for buyers who want their right-sizing program designed so commitments follow the fleet instead of stranding behind it.

The flexible sizing rule in one sentence

Resize within instance families freely under size flexibility, plan every family or architecture change against Convertible RIs or a Compute Savings Plan so the discount follows the workload, and use attribute-based selection to keep the fleet cheap going forward. To sequence a right-sizing program around your commitments, Contact Us.

FAQ: flexible EC2 sizing

What is size flexibility? Regional RIs apply across sizes within a family via normalization.

Does right-sizing break RIs? Only when you change family — resizing within a family stays covered.

What is attribute-based selection? Specifying instances by required attributes so AWS picks the cheapest match.

Talk to an AWS negotiation advisor

Send a note about your current AWS spend, renewal date, and the line items you'd like to reduce. We respond within one business day. Work email required.

Please use a work email address - free email domains are not accepted.

Your AWS bill
is negotiable.

$2.4B+ AWS spend reviewed. 500+ engagements. 38% average reduction. $340M+ in documented client savings. We build your negotiation strategy within 48 hours.

Contact Us →Download Playbooks