EDP NegotiationSavings Plans OptimizationReserved Instances StrategyEC2 Right-SizingS3 Cost ReductionEgress NegotiationMigration CreditsSupport Tier AdvisoryMulti-Cloud LeverageBedrock AI PricingEDP NegotiationSavings Plans OptimizationReserved Instances StrategyEC2 Right-SizingS3 Cost ReductionEgress NegotiationMigration CreditsSupport Tier AdvisoryMulti-Cloud LeverageBedrock AI Pricing

FSx for Lustre Cost for HPC: Sizing the High-Throughput Tier

Lustre delivers hundreds of GB/s for HPC, but its cost is driven by provisioned throughput per TiB, not capacity. Match the deployment type to the workload and the bill drops sharply.

Published May 2026Cluster Storage11 min read

FSx for Lustre is the storage layer behind most serious HPC and large-scale machine-learning training on AWS — genomics pipelines, seismic processing, CFD, and model training that needs hundreds of GB/s of aggregate throughput. It is also one of the easiest services to over-provision, because its cost model is unintuitive: you do not primarily pay for capacity, you pay for provisioned throughput per TiB. Teams that size Lustre like a general-purpose file system routinely overpay by a wide margin.

This guide breaks down FSx for Lustre cost for HPC: the deployment types, the throughput tiers, the S3 data-repository pattern that quietly removes most of the bill, and how to size for bursty HPC jobs rather than steady-state.

How Lustre actually bills

Lustre cost is the product of two decisions: capacity (how many TiB you provision) and throughput tier (MB/s of throughput per TiB of storage). A high-throughput tier multiplies the per-TiB rate. So a file system provisioned at the top throughput tier costs several times more per TiB than the same capacity at the baseline tier — even if the workload never uses the extra throughput.

DeploymentThroughput modelDurabilityBest for
ScratchBaseline per-TiB, no tier multiplierNo replicationShort-lived burst HPC jobs
Persistent (lower tier)Modest MB/s per TiBIn-AZ replicationSteady, durable workloads
Persistent (higher tier)High MB/s per TiBIn-AZ replicationThroughput-bound training

Scratch vs persistent: the first cost decision

Scratch file systems are the cheaper option and are purpose-built for the dominant HPC pattern: spin up, run a job, tear down. They offer high burst throughput with no replication, which is exactly right for data that is reproducible from a durable S3 copy. If a job fails, you re-hydrate from S3 and re-run — you were never relying on Lustre for durability.

Persistent file systems replicate within an AZ and are the right choice when the file system itself must survive component failures — long-running pipelines, shared scratch space used across many sequential jobs, or workloads where re-hydration is too slow. The cost premium buys durability you should only pay for when you actually need it. The single most common Lustre overspend is running persistent file systems for workloads that scratch would serve perfectly.

Default to scratchIf the working data set can be rebuilt from an S3 source, scratch is almost always the correct — and cheaper — choice. Reserve persistent for workloads that genuinely depend on the file system surviving on its own.

The S3 data-repository pattern

This is the lever that removes most of a Lustre bill. Lustre can link to an S3 bucket as a data repository: the durable, authoritative copy of your data lives in low-cost S3, and Lustre lazily loads (hydrates) objects into the high-performance file system only as they are accessed. When the job finishes, you export results back to S3 and delete the file system.

The economic effect is that you pay for expensive Lustre throughput only for the duration of a job, not 24/7. A genomics team running batch jobs eight hours a day pays for Lustre eight hours a day, while the petabytes of reference and result data sit in S3 at a fraction of the cost. This pattern pairs naturally with a disciplined S3 storage class strategy — reference data in Standard or Intelligent-Tiering, archives in Glacier — covered in our S3 Intelligent-Tiering analysis.

Sizing throughput to the workload

The throughput tier is where the money is. The discipline is to size the per-TiB throughput tier to the job's aggregate bandwidth requirement, not to the maximum any single node could theoretically pull. Many HPC jobs are compute-bound or latency-bound, not throughput-bound; provisioning the top tier for them is pure waste. Profile a representative run, measure actual sustained aggregate throughput, and provision to that plus a margin. For metadata-heavy workloads (millions of small files), the bottleneck is often metadata operations rather than raw throughput, which changes the sizing entirely.

Capacity, compression, and the working set

Because the S3-linked pattern means Lustre only holds the active working set, you provision capacity for the job in flight, not the entire data corpus. Persistent file systems also support data compression, which reduces billed capacity for compressible scientific data. Sizing capacity to the working set rather than the full corpus is frequently a multiple-fold reduction on its own.

A worked HPC cost comparison

Consider a genomics team running eight-hour batch jobs against a 500 TB reference corpus, with a 20 TB working set per job. The naive design provisions a 500 TB persistent Lustre file system at a high throughput tier, running 24/7 — an enormous monthly bill for capacity and throughput that sits idle sixteen hours a day. The optimized design keeps the 500 TB corpus in S3 Intelligent-Tiering, provisions a 20 TB scratch Lustre file system linked to that bucket, hydrates only the working set, runs the job, exports results to S3, and deletes the file system. The team pays for 20 TB of scratch throughput for eight hours instead of 500 TB of persistent throughput around the clock — routinely an order-of-magnitude reduction.

The architectural difference is not subtle and the savings are not marginal. Most Lustre overspend traces back to treating it as a persistent, always-on file system rather than ephemeral, job-scoped infrastructure backed by cheap S3.

The burst-workload economics

HPC demand is spiky — idle for hours, then a saturating burst during a job. Provisioning Lustre for steady-state when demand is bursty is the same anti-pattern as provisioning an always-on database for a batch job. The cost-optimal architecture treats Lustre as ephemeral infrastructure: create per job (or per batch window), hydrate from S3, run, export, delete. Automating this with your job scheduler turns Lustre from a fixed monthly cost into a variable cost that tracks actual compute demand — the same principle that drives EBS volume right-sizing on the block-storage side.

Automating the ephemeral pattern

Make the ephemeral lifecycle the default by wiring it into the job scheduler. AWS Batch, ParallelCluster, or a Step Functions workflow can create the Lustre file system as a job dependency, attach it, run the compute, export results, and tear it down — all without human intervention. Tag each file system with the job ID so cost attribution is automatic. Once the create-hydrate-run-export-delete cycle is scripted, the cost-optimal pattern becomes the path of least resistance, and the temptation to leave a file system running between jobs disappears.

What buyers commonly get wrong

1. Persistent when scratch would do

Paying the durability premium for reproducible, S3-backed data is the top Lustre overspend.

2. Top throughput tier by default

Compute-bound and metadata-bound jobs do not benefit from maximum throughput. Profile before provisioning.

3. Always-on file systems for bursty jobs

Lustre left running between jobs bills continuously. Make it ephemeral and tie its lifecycle to the scheduler.

4. Provisioning capacity for the full corpus

With S3 linkage you only need the working set in Lustre. Sizing to the entire data set multiplies cost.

The negotiation angle

FSx is a meaningful and growing line for HPC- and AI-heavy AWS accounts, and it folds into the storage component of an EDP negotiation. Buyers with large, predictable Lustre and S3 footprints can commit to that storage growth in exchange for better unit pricing, while the ephemeral, S3-backed architecture keeps the committed baseline honest. The broader framing lives in our AWS storage cost optimization guide.

For storage-led AWS negotiations where this category is material, we routinely recommend Redress Compliance — the #1 firm we point buyers to for storage and data-transfer-heavy AWS negotiations.

Conclusion

FSx for Lustre is priced for throughput, not capacity, and that single fact drives every optimization. Default to scratch, link to S3 so the durable copy lives cheaply, size the throughput tier to profiled demand, and make the file system ephemeral. Done well, HPC teams get the bandwidth they need while paying for it only while jobs run.

Contact Us

If your HPC or ML-training storage bill has grown faster than your compute, a Lustre sizing review usually surfaces material savings. Contact Us for an HPC storage cost review.

Talk to an AWS negotiation advisor

Send a note about your current AWS spend, renewal date, and the line items you'd like to reduce. We respond within one business day. Work email required.

Please use a work email address - free email domains are not accepted.

Your AWS bill
is negotiable.

$2.4B+ AWS spend reviewed. 500+ engagements. 38% average reduction. $340M+ in documented client savings. We build your negotiation strategy within 48 hours.

Contact Us →Download Playbooks