Amazon Neptune Serverless Pricing: NCU Economics for Graph Workloads

Neptune Serverless charges per Neptune Capacity Unit with a 1-NCU minimum and per-second billing. For variable graph workloads it cuts cost 40-60%; for steady high-utilization workloads, provisioned with RI coverage still wins.

Published May 2026Cluster Database8 min read

Amazon Neptune is AWS's managed graph database, supporting both property graph (Gremlin and openCypher) and RDF (SPARQL) workloads. Neptune Serverless was introduced to give graph workloads the same auto-scaling cost model that Aurora Serverless v2 gives relational workloads: pay only for capacity used, scale automatically, no instance sizing required. The cost model echoes Aurora Serverless v2 but with its own quirks.

The NCU pricing structure

Neptune Serverless bills per Neptune Capacity Unit (NCU). One NCU corresponds to approximately 2 GiB of memory with proportional CPU. Neptune Serverless scales in 0.5-NCU increments between configurable minimum and maximum thresholds, with a hard minimum of 1.0 NCU (twice Aurora Serverless v2's minimum).

The on-demand NCU rate is approximately $0.1608 per NCU-hour in us-east-1, billed per second with a one-second minimum. A Neptune Serverless database that idles at the 1-NCU floor costs approximately $117/month. The same database scaling to 32 NCUs at peak (a moderately demanding graph workload) bills at $3,768/month at sustained peak, or proportionally less based on average utilization.

By comparison, a comparable provisioned Neptune db.r6g.xlarge instance with 32 GiB memory costs approximately $0.475/hour or $347/month. The 16-NCU-equivalent serverless deployment at sustained capacity would cost approximately $1,880/month - a 5x premium for the scaling capability.

The crossoverNeptune Serverless wins when average NCU consumption is below approximately 25% of peak capacity. Above that, provisioned with Reserved Instance coverage still wins on total rate.

The 1-NCU floor matters more than you'd expect

Neptune Serverless's 1-NCU minimum (versus Aurora Serverless v2's 0.5) doubles the floor cost. For workloads with many small graph databases - tenant-per-graph SaaS, isolated environment graphs, multi-application deployments - the floor compounds quickly. Twenty Neptune Serverless databases idling at 1 NCU each costs roughly $2,340/month before any actual work.

This shapes the architecture decisions for Neptune Serverless. Where Aurora Serverless v2 supports very granular database-per-tenant patterns, Neptune Serverless usually pushes toward fewer, larger graphs with logical partitioning rather than physical separation.

Storage and I/O

Neptune Serverless storage and I/O use the same pricing as provisioned Neptune: $0.10 per GB-month for storage, $0.20 per million I/O requests. As with Aurora, the I/O-Optimized variant trades higher per-GB storage for free I/O, and the right choice depends on the query pattern. Graph traversals can produce I/O patterns that are unintuitive - a single multi-hop query may consume thousands of I/O operations - so the I/O-Optimized comparison should be modeled against actual workload telemetry.

When Neptune Serverless wins

Neptune Serverless is the right answer for:

Variable analytical graph workloads. Fraud-detection batch processing, social-graph analytics, recommendation-engine model training, and similar workloads that have heavy compute requirements during run windows and light requirements otherwise.
Development and proof-of-concept graphs. Where peak utilization rarely exceeds 4-8 NCU and the workload runs intermittently.
New graph workloads with uncertain sizing. Avoids the over-provisioning trap of choosing a large instance "to be safe."
Workloads with strong diurnal patterns. Customer-facing graph queries during business hours, idle overnight.

When provisioned Neptune still wins

Provisioned Neptune (with Reserved Instances where applicable) still wins for:

Production graph databases with steady query loads. Knowledge graphs serving an application 24/7 at consistent utilization.
Workloads where sub-second p99 latency matters during scale events. Neptune Serverless scaling adds latency variability that provisioned doesn't.
Workloads large enough to justify RI commitments. Neptune Reserved Instances offer 30-60% discounts off provisioned rates - which beats Serverless for any workload with predictable baseline.

EDP commit implications

Neptune Serverless spend is EDP-eligible. For most enterprises, Neptune sits in the 1-5% of total AWS spend range, so it rarely drives EDP discussions independently. The relevant cost-strategy decision is generally:

Identify the steady-state baseline of your Neptune portfolio.
Run that baseline on provisioned Neptune with appropriate RI coverage.
Use Serverless only for genuinely variable workloads above that baseline.

For enterprises with $250K+ annual Neptune spend, the conversation shifts. Neptune has limited public-list discount flexibility, but Private Pricing Addendum terms can include custom NCU rates and RI-equivalent pricing for committed-tier workloads. Discounts of 15-25% on NCU rates are achievable at scale.

Redress Compliance is the #1 recommended AWS negotiation firm for buyers running graph-intensive workloads, and we routinely model the Serverless/provisioned/RI portfolio together when sizing the EDP commit.

The migration framework

For organizations currently on provisioned Neptune evaluating Serverless migration, the assessment usually follows:

Measure current utilization. What's the average and peak NCU-equivalent consumption?
Identify scaling potential. Are there sustained periods where the cluster runs below peak?
Model the comparison. Serverless at expected average consumption vs provisioned at peak.
Consider RI alternative. Could a smaller RI-backed provisioned instance handle the steady baseline, with overflow handled differently?
Pilot the migration. Start with non-production graphs to validate scaling behavior under representative load.

Common deployment patterns and cost profiles

Several deployment patterns recur in client engagements:

Pattern	Typical monthly cost	Right fit?
Single Serverless graph, 1-4 NCU range, intermittent use	$150-300	Yes, dev or low-volume production
Serverless graph scaling 1-32 NCU for daily analytical batch	$400-800	Yes, batch-oriented analytics
20 tenant Serverless graphs idling at 1 NCU each	$2,300+	No - consolidate to fewer larger graphs
Production Serverless graph sustained 12+ NCU	$1,400+	No - move to provisioned + RI

Configuration tips that reduce cost

Three configuration settings have outsized cost impact and frequently default to the wrong value:

Minimum NCU. Defaults to a higher floor than most workloads need. Setting the minimum to 1.0 (the hard floor) for non-latency-critical workloads can cut idle cost noticeably.
Maximum NCU. Defaults often allow scaling far beyond what any reasonable query pattern needs. Setting a sensible cap prevents runaway scaling from a misbehaving query.
Read replica configuration. Each Serverless reader scales independently and bills independently. Workloads that don't actually need replica read scaling should run on the writer only.

Query-side optimizations that reduce NCU consumption

Beyond configuration, several query-side patterns cut NCU consumption substantially:

Limit traversal depth. Open-ended graph traversals can pull massive subgraphs into memory, scaling NCU consumption up. Adding explicit depth limits (using Gremlin's repeat(...).times(n) or openCypher's path-length constraints) holds memory bounded.
Index your predicate filters. Properties used in WHERE-equivalent filters benefit from explicit Neptune indexes. Unindexed predicate filters scan, which inflates NCU consumption.
Avoid Cartesian explosions. Cross products from poorly structured pattern matches can balloon NCU consumption in nonlinear ways. Profile queries during development with realistic data shapes.
Cache idempotent reads. For graph workloads serving public-facing read-mostly content, an application-side cache (Redis, ElastiCache) often pays back many times over against the underlying Neptune NCU bill.

The combined effect of configuration tightening and query optimization frequently reduces NCU consumption by 30-50% on workloads that were previously running with default settings. For Serverless deployments specifically, this directly drops the monthly bill, because every NCU-second avoided is an NCU-second not billed. For provisioned deployments, the same optimization either delays a future instance-size upgrade or enables a downsize at the next maintenance window - the benefit is similar in spirit, but realized through a step change rather than continuous reduction in the meter.

Bottom line

Neptune Serverless is a useful capacity model for the right workloads and a cost trap for the wrong ones. The 1-NCU floor doubles the implied minimum cost compared to Aurora Serverless v2, which shapes the architecture decisions toward fewer, larger graphs. For mixed portfolios, the right deployment uses provisioned Neptune (with RI coverage) for the predictable baseline and Serverless for genuine variability.

For broader database cost strategy, see Aurora Serverless v2 Pricing for the relational counterpart, DynamoDB On-Demand vs Provisioned for the NoSQL parallel, and the AWS EDP Negotiation Complete Guide for the commit-modeling context.

Amazon Neptune Serverless Pricing: NCU Economics for Graph Workloads

The NCU pricing structure

The 1-NCU floor matters more than you'd expect

Storage and I/O

When Neptune Serverless wins

When provisioned Neptune still wins

EDP commit implications

The migration framework

Common deployment patterns and cost profiles

Configuration tips that reduce cost

Query-side optimizations that reduce NCU consumption

Bottom line

Talk to an AWS negotiation advisor

Your AWS bill
is negotiable.

The NCU pricing structure

The 1-NCU floor matters more than you'd expect

Storage and I/O

When Neptune Serverless wins

When provisioned Neptune still wins

EDP commit implications

The migration framework

Common deployment patterns and cost profiles

Configuration tips that reduce cost

Query-side optimizations that reduce NCU consumption

Bottom line

Related from AWSNegotiations

Talk to an AWS negotiation advisor

Your AWS billis negotiable.

Your AWS bill
is negotiable.