Compute Savings Plans
Lambda and Fargate spend is covered by Compute SPs. Most environments are under-covered on serverless spend specifically. See our SP optimization.
Serverless looks cheap at small scale. At enterprise volume, the picture is more complicated. Lambda, Fargate, Step Functions, and API Gateway each have their own pricing logic — and most are covered by Compute Savings Plans.
Serverless is not a single product. AWS markets four distinct services under the umbrella, each with a different billable unit and a different optimization lever. The right approach depends on where your spend concentrates.
| Service | Billable Units | Savings Plan Coverage |
|---|---|---|
| Lambda | Requests + GB-seconds (memory x duration) | Compute Savings Plans |
| Fargate (ECS/EKS) | vCPU-hours + GB-hours | Compute Savings Plans |
| Step Functions | State transitions (Standard) or duration (Express) | No SP coverage |
| API Gateway | Requests + data transfer | No SP coverage |
| EventBridge | Events processed | No SP coverage |
Lambda's free tier hides the real economics. At enterprise volume, Lambda becomes economically equivalent to Fargate at around 50% sustained CPU utilization, and to a Spot-backed EC2 fleet at around 20% sustained utilization. Above those thresholds, you are paying a serverless premium. Below them, Lambda is the cheaper option even at scale.
The most important Lambda lever is memory tuning. Memory determines both the per-execution cost (GB-seconds) and the CPU allocation. Many functions are configured at 128MB by default and run two to four times slower than they would at 512MB or 1024MB. The longer execution outweighs the lower memory rate, so the bill goes up while the function is undersized. Memory tuning typically reclaims 25-40% of Lambda spend in untouched environments. AWS Lambda Power Tuning makes this mechanical.
Fargate sits between Lambda and EC2 in pricing logic. It is per-second billed, vCPU and memory configurable, and covered by Compute Savings Plans. The Fargate-versus-EC2 crossover is roughly 60-70% sustained utilization for the Fargate task. Below that threshold, Fargate is cheaper. Above it, EC2 with Auto Scaling Group is cheaper. Fargate Spot extends that crossover meaningfully — interruption-tolerant workloads are almost always cheaper on Fargate Spot than on managed EC2.
Graviton-backed Lambda functions (arm64 architecture) carry a 20% discount on GB-seconds versus x86, and frequently outperform x86 on equivalent memory. The migration is a single configuration change for most Python and Node.js workloads. Fargate has the same Graviton option (FARGATE platform 1.4.0+ with arm64 task definition). Most serverless estates have not made the switch. The savings are 15-22% across the affected workloads, with no operational change required.
Lambda and Fargate spend is covered by Compute SPs. Most environments are under-covered on serverless spend specifically. See our SP optimization.
Lambda Power Tuning sweep across all production functions. Reclaim 25-40% of Lambda spend by sizing memory correctly.
Switch Lambda and Fargate workloads to arm64. 20% discount with no real migration effort for most managed runtimes.
Standard charges per state transition; Express charges per execution duration. Most workflows belong on one or the other, not both.
HTTP API is up to 71% cheaper than REST API for equivalent endpoints. Most accounts still default to REST API.
For interruption-tolerant containers, Fargate Spot delivers 70% off list. Most EKS clusters should run 50%+ on Fargate Spot.
500+ engagements. $340M+ in documented savings. We tune memory, switch architectures, cover with Savings Plans, and negotiate the EDP tier on top.