Cost

Convert idle RDS to Aurora Serverless v2

A dev database that sits idle nights and weekends still pays full price 24/7 on provisioned RDS — Aurora Serverless v2 scales capacity down when no one's using it, so you stop renting a box that's mostly asleep.

15 min·10 sections·AWS

Last reviewed 27 May 2026

Aurora Serverless v2: the basics

Why a provisioned database that's idle most of the time is the wrong shape

Provisioned RDS and provisioned Aurora bill you for a fixed instance — a db.r6g.large, say — for every hour it exists, whether it's serving ten thousand queries a second or sitting completely idle overnight. The instance is a box you rent by the hour, and its price has nothing to do with how hard you're working it. A dev database that's busy from 9 to 5 on weekdays and silent the other ~75% of the week still pays the full 168-hour weekly rate.

Aurora Serverless v2 changes the unit of billing from "instance-hours" to "capacity-hours." Capacity is measured in ACUs — Aurora Capacity Units, where 1 ACU is roughly 2 GB of RAM plus proportional CPU and networking, billed at about $0.12 per ACU-hour in US-East. You set a floor (MinCapacity) and a ceiling (MaxCapacity), and Aurora scales the running capacity up and down within that band in fine-grained steps, in near-real-time, in response to load. When the workload goes quiet, capacity drops toward the floor and the bill drops with it.

The flag fires when a database shows low average utilisation with idle stretches — typical of dev/test, internal tools, spiky or seasonal workloads, and the long tail of many small databases. Those are exactly the cases where paying a flat 24/7 instance rate is wasteful, and where letting capacity follow demand turns most of the idle hours into near-zero cost.

In this lesson you'll learn how to tell which databases are good Serverless v2 candidates and which should stay on provisioned-plus-Reserved-Instance, how ACU min/max sizing works, why v2 scales toward a configured floor rather than the hard zero the deprecated v1 offered, and the safe migration path of adding a Serverless v2 reader to the cluster and failing over. You'll see the CloudWatch queries that expose idle time and the exact CLI call that configures the scaling band — plus the edge cases that bite, like cold-ish scale-up latency and the cost crossover where steady high utilisation makes provisioned cheaper.

Fun fact

The weekend that never paid for itself

A team measured a single dev Aurora cluster on a db.r6g.large — about $0.29/hour, roughly $210 a month flat. CloudWatch showed it handled real queries for about 45 hours a week and sat at near-zero connections the other 123. They moved it to Serverless v2 with MinCapacity 0.5 and MaxCapacity 8. The busy hours cost a little more per hour at peak, but the idle 123 hours/week collapsed to the 0.5-ACU floor — about $0.06/hour. The cluster's monthly bill dropped to roughly $70. The punchline: across 40 similar dev and test databases, the same change saved more than the entire team's annual conference budget, and not one engineer noticed a difference in the working day.

Converting an idle database in action

Marcus runs the platform team at a logistics company. A finance review flags that their non-production Aurora fleet — 18 clusters on fixed db.r6g.large and db.r6g.xlarge instances — is costing about $4,800 a month, and most of them are dev and CI databases.

He pulls 14 days of CloudWatch on one of them. DatabaseConnections averages 3 during working hours and drops to zero from 7pm to 8am and all weekend. CPUUtilization sits around 6% with brief spikes to 30% during CI runs. This is the textbook idle-most-of-the-time profile: a fixed instance is paying full price for ~120 idle hours a week.

Rather than rebuild the cluster, Marcus adds a Serverless v2 reader to the existing Aurora cluster, lets it sync, then fails over so the serverless instance becomes the writer — a few seconds of connection blip, no data migration. He sets MinCapacity 0.5 and MaxCapacity 8 so CI bursts still get headroom. Projected drop on that one cluster: from ~$210 to ~$75 a month; across the candidate set, roughly $3k a month.

First, confirm the idle profile. Pull connection counts over two weeks — long stretches near zero are the strongest signal that a fixed instance is overpaying.

$ aws cloudwatch get-metric-statistics --namespace AWS/RDS --metric-name DatabaseConnections --dimensions Name=DBClusterIdentifier,Value=dev-orders-aurora --start-time $(date -u -d '14 days ago' +%FT%TZ) --end-time $(date -u +%FT%TZ) --period 3600 --statistics Average Maximum

{

"Datapoints": [

{ "Timestamp": "2026-05-12T10:00:00Z", "Average": 3.2, "Maximum": 11.0, "Unit": "Count" },

{ "Timestamp": "2026-05-12T22:00:00Z", "Average": 0.0, "Maximum": 0.0, "Unit": "Count" },

{ "Timestamp": "2026-05-13T03:00:00Z", "Average": 0.0, "Maximum": 0.0, "Unit": "Count" },

{ "Timestamp": "2026-05-13T10:00:00Z", "Average": 2.9, "Maximum": 9.0, "Unit": "Count" }

]

}

# Zero connections every night and weekend — a fixed instance pays full price for all of it.

14-day hourly connection counts: the idle windows that a 24/7 instance is paying for.

Now configure the Serverless v2 scaling band on the cluster. MinCapacity is the floor the cluster scales down to when idle; MaxCapacity is the spend ceiling and the headroom for bursts.

$ aws rds modify-db-cluster --db-cluster-identifier dev-orders-aurora --serverless-v2-scaling-configuration MinCapacity=0.5,MaxCapacity=8 --apply-immediately

{

"DBCluster": {

"DBClusterIdentifier": "dev-orders-aurora",

"EngineMode": "provisioned",

"ServerlessV2ScalingConfiguration": {

"MinCapacity": 0.5,

"MaxCapacity": 8.0

}

# EngineMode stays 'provisioned' — Serverless v2 instances live inside a normal Aurora cluster.

Setting the ACU band: 0.5 ACU floor (~$0.06/hr idle) up to 8 ACU for CI bursts.

Aurora Serverless v2 under the hooddeep dive

Serverless v2 is not a separate engine mode — it's an instance class (db.serverless) that lives inside an ordinary Aurora cluster alongside, or instead of, provisioned instances. That's why migration is non-destructive: you add a db.serverless reader to an existing provisioned cluster, let it catch up, then fail over to make it the writer. There's no dump-and-restore and no new endpoint; the cluster endpoint stays the same and clients reconnect through the same DNS after a few-second failover blip.

Capacity is measured in ACUs. One ACU is approximately 2 GB of memory with proportional vCPU and network, billed at roughly $0.12 per ACU-hour in US-East — so 4 ACUs (~8 GB) is about $0.48/hour, comparable to a provisioned db.r6g.large but only while you're actually running at 4 ACUs. Aurora adjusts capacity in fine increments within your MinCapacity/MaxCapacity band, scaling up in seconds under load and easing down as it subsides. Crucially, v2 scales toward your configured floor, not to a hard stop: classically MinCapacity was 0.5 ACU (so an idle cluster still costs ~$0.06/hour but stays warm with no cold-start), and newer configurations support scaling down to 0 ACU for auto-pause-like behaviour. This is the key difference from the deprecated Serverless v1, which paused to truly zero but suffered multi-second resume latency on the first connection.

The cost crossover is the thing to get right. Because per-ACU-hour pricing is set so that running at steady capacity costs a premium over an equivalent always-on provisioned instance, a database pinned near its peak 24/7 is cheaper on provisioned-plus-Reserved-Instance than on serverless. Serverless wins precisely when capacity spends most of its time well below peak — idle nights, quiet weekends, spiky bursts. The break-even rule of thumb: if average utilisation is below roughly 40-50% of peak with real idle windows, serverless usually wins; if it's steady and high, stay provisioned and commit.

# Non-destructive migration: add a Serverless v2 reader to an existing provisioned cluster.
aws rds create-db-instance \
  --db-instance-identifier dev-orders-serverless \
  --db-cluster-identifier dev-orders-aurora \
  --db-instance-class db.serverless \
  --engine aurora-postgresql

# Wait for it to become available and catch up on replication.
aws rds wait db-instance-available --db-instance-identifier dev-orders-serverless

# Fail over so the serverless instance becomes the writer, then remove the old provisioned one.
aws rds failover-db-cluster \
  --db-cluster-identifier dev-orders-aurora \
  --target-db-instance-identifier dev-orders-serverless

What is the impact of leaving idle databases on provisioned instances?

The direct cost is the most visible: a fixed instance bills the same whether it's saturated or idle. A db.r6g.large at ~$0.29/hour is ~$210/month, and a dev database that's genuinely used ~45 hours a week is paying for ~123 idle hours every week — about three-quarters of the bill bought nothing. Multiply across a non-production fleet of dozens of clusters and that's thousands of dollars a month of pure idle-time spend.

The second-order impact is sizing inertia. Once a fixed instance is chosen it tends to be sized for the worst case — the CI burst, the month-end batch, the demo — and then runs at that size permanently. Serverless removes the incentive to over-provision for peaks, because peaks only cost more during the peak. Teams that stay on fixed instances keep paying peak-sized bills for average-sized workloads, and the gap compounds as workloads drift quieter over time.

There's a commitment-economics trap in both directions. Reserved Instances and the RDS-equivalent reservations apply to provisioned instances, not to serverless ACU-hours, so committing reservations against an idle database locks in the wrong shape — you pay for a size you barely use for one to three years. But the inverse is also a trap: moving a genuinely steady, high-utilisation production database to serverless can raise the bill, because per-ACU-hour at sustained capacity exceeds a committed provisioned instance. Right-targeting is the whole game.

Finally, leaving the fleet on fixed instances makes spend less responsive to reality. When a project winds down or traffic seasonally drops, a serverless database's bill falls automatically; a provisioned one keeps charging the same until someone manually resizes or deletes it — and that someone usually never gets around to it. Usage-following cost turns "remember to scale this down" from a chore nobody does into the default behaviour.

How do you convert idle databases to Serverless v2 safely?

Conversion is a four-step loop that runs on the FinOps cadence: measure utilisation honestly, segment the fleet into serverless-wins vs commit-wins, migrate non-destructively, and set a scaling band that protects both performance and budget.

1. Measure utilisation before you decide anything

Pull at least 14 days of CloudWatch for each candidate: DatabaseConnections and CPUUtilization at minimum, plus FreeableMemory if the workload is memory-bound. You're looking for two things — average utilisation as a fraction of peak, and the presence of genuine idle windows (nights, weekends, between batch runs). Sustained zero-connection stretches are the clearest serverless signal; a flat, high, always-busy line is the clearest signal to leave it on provisioned.

2. Segment the fleet: serverless wins vs commit wins

Apply the crossover rule. Below roughly 40-50% average utilisation with real idle time, serverless almost always wins — most of the week collapses toward the floor. Steady, high, 24/7 utilisation is cheaper on provisioned plus a Reserved Instance, because per-ACU-hour at sustained capacity exceeds an equivalent committed instance. Never move a good reservation candidate to serverless, and never buy a reservation for a database you're about to make serverless.

3. Migrate non-destructively via reader-then-failover

Don't rebuild the cluster. Add a db.serverless instance as a reader to the existing Aurora cluster, let it catch up on replication, then fail over so it becomes the writer and remove the old provisioned instance. The cluster endpoint is unchanged, so clients just reconnect after a few-second blip — no data migration, no new connection string. For a database that isn't already Aurora, migrate it into Aurora first (via snapshot restore or DMS) before adding the serverless instance.

4. Set MinCapacity and MaxCapacity deliberately

MinCapacity is the warm floor: 0.5 ACU keeps an idle cluster responsive at ~$0.06/hour with no cold-start, while newer configs allow scaling to 0 ACU for auto-pause-like behaviour at the cost of a brief resume on the next connection — fine for dev, not for anything latency-sensitive. MaxCapacity is both the performance headroom for bursts and your spend ceiling, so size it to cover the real peak (the CI run, the batch job) and no more. Review the band quarterly as the workload drifts.

# Verify the workload is a serverless candidate (low average CPU with idle windows),
# then set a scaling band that covers bursts without leaving an oversized ceiling.
aws cloudwatch get-metric-statistics \
  --namespace AWS/RDS --metric-name CPUUtilization \
  --dimensions Name=DBClusterIdentifier,Value=dev-orders-aurora \
  --start-time $(date -u -d '14 days ago' +%FT%TZ) \
  --end-time $(date -u +%FT%TZ) \
  --period 3600 --statistics Average Maximum \
  --query 'sort_by(Datapoints,&Maximum)[-1]'

# Floor 0.5 ACU (warm, ~$0.06/hr idle), ceiling 8 ACU (CI burst headroom + spend cap).
aws rds modify-db-cluster \
  --db-cluster-identifier dev-orders-aurora \
  --serverless-v2-scaling-configuration MinCapacity=0.5,MaxCapacity=8 \
  --apply-immediately

Quick quiz

Question 1 of 5

A dev Aurora cluster on a db.r6g.large averages 6% CPU with zero connections every night and weekend, but spikes to 30% during CI runs. What's the right move?

Keep learning

Dig deeper into Aurora Serverless v2 mechanics, capacity sizing, and the cost trade-offs.

You've completed Convert idle RDS to Aurora Serverless v2. You now know how to read utilisation to spot idle-heavy candidates, where the serverless-vs-provisioned crossover sits, the non-destructive reader-then-failover migration path, and how to set a MinCapacity floor that stays warm and a MaxCapacity ceiling that caps spend. The next time a finance review flags a flat non-production database line, you'll have a four-step loop ready to run.

Back to the library

Aurora Serverless v2: what it means for the bill

Paying for a database by usage instead of by the hour it exists

Today most database line items on the cloud bill are fixed: the team picked an instance size once, and you pay that same amount every hour of every day regardless of whether anyone is using it. For a production database serving customers around the clock, that's fair — the capacity is genuinely needed. For a development, test, or internal database that's only busy during working hours, you're paying full price for a lot of idle time, often 60-75% of the week.

Aurora Serverless v2 switches the meter from "size of the box" to "amount of work done," measured in capacity units billed by the hour. When the database is busy it scales up and costs more; when it's quiet it scales down toward a configured floor and costs a fraction of that. For an intermittent workload the monthly bill tracks actual usage instead of sitting flat at the peak. The trade-off matters: for a database that really does run hard 24/7, paying per-unit-of-work continuously can cost more than a committed provisioned instance, so this is not a blanket "always cheaper" lever.

From a budgeting standpoint, the useful framing is that serverless converts a fixed cost into a variable one that follows the workload. That makes dev/test and bursty spend more honest — it falls when usage falls — but it also makes it less predictable month to month, so a sensible MaxCapacity ceiling acts as your spend guardrail. The number to ask for is the candidate list: which databases are sub-30% utilised with clear idle windows, and what's the projected monthly delta if they move to serverless.

This lesson is for the finance partner who sees a flat database line on the cloud invoice and wants to know whether it should be flat. It walks through why a fixed instance overcharges an intermittently-used database, how serverless converts that fixed cost into a usage-following variable cost, where the crossover point is so you don't accidentally make a steady workload more expensive, and what the MaxCapacity ceiling does as a budget guardrail. By the end you'll know which databases to put on the candidate list at the monthly review and what "good" looks like as a utilisation number and a trend.

Fun fact

The weekend that never paid for itself

How a finance partner frames the candidate list

Priya is the finance partner embedded with the platform team at a logistics company. At the monthly cost review the database line is flat at $4,800 across the non-production fleet, and she asks the question that's now standard on the agenda: "How many of these databases are busy less than a third of the time, and what would they cost if they billed by usage instead of by the hour?"

The answer isn't technical. Engineering pulls a one-page report: 14 of the 18 clusters run real queries fewer than 50 hours a week and sit idle the rest. The projected delta from moving those to a usage-based model is about $3,000 a month — a 60%-plus reduction on the part-time databases, with the four genuinely-busy production-adjacent clusters left on their committed instances because for those, usage-based would cost more, not less.

Priya adds two things to the finance pack. First, a recurring line showing non-production database utilisation, so the candidate list is visible every month rather than rediscovered annually. Second, a MaxCapacity sign-off note — because serverless turns a fixed cost into a variable one, she wants the ceiling documented so the bill can't surprise anyone. Three months later the line is down to $1,700 and stable, and she knows the right floor is "part-time databases bill like part-time, full-time ones stay committed."

Why this matters to the budget, not just the bill

The direct impact is material and concentrated: non-production database spend is often 10-25% of total database cost, and on idle-heavy fleets a large fraction of that is paying for hours when nothing is running. Moving the right candidates to usage-based pricing typically cuts those specific line items by half or more, which is enough to show up in a quarterly database budget rather than getting lost in variance.

The bigger budgeting impact is converting a fixed cost into a variable one that tracks the workload. That's a double-edged change finance should understand on purpose. The upside: dev/test and seasonal spend stop being flat at peak and fall when usage falls, which makes forecasts more honest. The cost: month-to-month spend becomes less predictable, so the MaxCapacity ceiling on each cluster is effectively the budget guardrail — it caps the worst case. Documenting those ceilings is the finance control that makes the variable model safe.

There's a commitment-economics angle that's easy to get wrong. Reservations apply to provisioned instances, not serverless capacity, so two rules follow: don't buy reservations for databases you're about to move to serverless, and don't move steadily-busy databases that are good reservation candidates onto serverless, because per-unit pricing at sustained load can exceed a committed instance. The right sequence is to segment the fleet first — part-time to serverless, full-time to committed provisioned — then commit. Getting the order wrong strands reservations or inflates the variable bill.

Finally, treat fleet utilisation as a leading indicator. If average non-production database utilisation is drifting down while the bill stays flat, that gap is exactly the money serverless recovers, and a flat bill against falling usage is the signal to re-run the candidate list. Watch the utilisation trend, not just the dollar total.

What finance can actually do about this

Finance can't reconfigure databases, but it can set the conditions that get the right ones onto the right pricing model and keep the variable bill safe. Three levers, used together at the monthly cadence.

1. Put non-production database utilisation on the monthly report

Add a standing line showing the non-production database fleet: dollar amount, count, and average utilisation. The candidate list — databases under ~40% utilised with idle windows — is the leading indicator. If the bill is flat while utilisation falls, that gap is recoverable spend and the prompt to re-run the segmentation.

2. Sequence reservations after segmentation, never before

Make it a rule that database reservation purchases happen only after the fleet has been split into serverless-wins and commit-wins. Buying a reservation for a database that's about to go serverless strands the commitment; the order matters more than the size. Finance owns the timing of commitments, so this is squarely a finance control.

3. Require a documented MaxCapacity ceiling on every serverless database

Serverless turns a fixed cost into a variable one, and MaxCapacity is the guardrail that caps the worst case. Make a signed-off ceiling a precondition for converting a database, so the variable bill can never surprise the budget. This is the single control that makes usage-based pricing safe to approve.

4. Track the utilisation trend, not just the dollar

It's normal and correct for some databases to stay on committed provisioned instances — that's the right model for steady workloads. The question isn't "why isn't everything serverless?" but "is every database on the model that fits its usage?" A fleet that's deliberately split, with utilisation watched monthly, is what good looks like; a fleet that's all-fixed regardless of usage is the problem.

Quick quiz

Question 1 of 5

The non-production database line is flat at $4,800/month, but engineering reports average fleet utilisation has fallen below 30% over the last quarter. As the finance partner, what's the right next move?

Keep learning

Dig deeper into Aurora Serverless v2 mechanics, capacity sizing, and the cost trade-offs.

You've finished the finance partner's view of converting idle databases to Serverless v2. You know why a fixed instance overcharges a part-time database, how usage-based pricing converts that into a variable cost with MaxCapacity as the guardrail, why reservation timing must follow fleet segmentation, and that the metric to watch is utilisation tracking the bill. Next time the database line shows up at the monthly review, you'll have a sharper question than "can we make this cheaper?"

Back to the library

Aurora Serverless v2: the headline

Stop renting databases full-time that are only used part-time

A large share of database spend in most organisations sits on dev, test, and internal databases that are only busy during working hours but are billed at full price all week. Aurora Serverless v2 lets capacity rise and fall with actual demand, so those part-time databases cost roughly what they're used for instead of a flat 24/7 rate. The savings on an idle-heavy fleet are typically meaningful, not marginal.

This is a targeting decision, not a mandate. Serverless wins on intermittent and unpredictable workloads; for databases that genuinely run hard around the clock, a committed provisioned instance is usually cheaper. The leadership question isn't "move everything to serverless" — it's "do we know which of our databases are mostly idle, and are we still paying full freight for them?"

A short read for the exec who wants the headline and the one question. You'll get the rule of thumb — usage-based for part-time databases, committed-provisioned for full-time ones — what this category signals about how deliberately the org sizes infrastructure, and what "good" looks like at the portfolio level. No commands, no internals.

Fun fact

The weekend that never paid for itself

What it looks like when the org gets this right

At one company the quarterly review used to show a flat non-production database line that never moved — the same number every quarter, justified as "that's just what the dev environments cost." The exec sponsor stopped accepting the flatness and asked one question: "How much of that is for databases nobody is using two-thirds of the week?"

The answer was: most of it. Within a quarter the part-time databases were moved to a usage-based model, the line dropped by more than half, and the genuinely-busy databases stayed on committed instances where they were already cheapest. Nobody had asked engineering to chase a dollar figure — the exec had asked them to stop paying full-time for part-time infrastructure, and the savings followed automatically.

That's the right outcome state. The goal was never "serverless everywhere"; it was "every database is on the pricing model that fits how it's actually used." Once that's true, the database line stops being a recurring question and becomes a confidence signal that infrastructure is sized deliberately.

Why this is on the report at all

Database spend is one of the larger and stickier categories on a cloud bill, and a meaningful slice of it sits on dev, test, and internal databases that are only used part of the week but billed full-time. This category is tracked because its size and trend say how deliberately the organisation matches pricing models to actual usage — a fleet that's all fixed instances regardless of utilisation usually means infrastructure is being sized once and forgotten, a pattern that recurs across far bigger spend categories.

The discipline here is targeting, not a blanket migration. Usage-based pricing wins on part-time and bursty databases; committed provisioned wins on genuinely steady ones. An org that gets this right has each database on the model that fits it, and the database line responds to real usage instead of sitting flat forever. That responsiveness — costs falling when a project winds down without anyone having to remember to act — is the signal worth caring about.

The leadership move on this category

The handle for an executive isn't to mandate serverless — it's to insist that pricing model follows usage, and to make that a one-line confidence check.

1. Insist that pricing model matches usage pattern

Part-time and bursty databases should bill by usage; genuinely full-time ones should be committed. The policy isn't "go serverless" — it's "no database stays on a model that doesn't fit how it's actually used." That single principle captures the savings without the over-correction of migrating everything.

2. Sequence commitments after the workload is understood

Require that reservation and commitment decisions come after the fleet has been segmented, not before. Committing to the wrong shape is one of the most common and least visible ways database spend gets locked in for years.

3. Make it a confidence signal at the leadership review

Ask for the trend, not the dollar: "Is non-production database cost tracking usage, or flat regardless?" A bill that responds to real usage is the sign infrastructure is sized deliberately. If that's true for three quarters running, the underlying discipline is healthy and attention belongs elsewhere.

Quick quiz

Question 1 of 5

You're reviewing the cloud cost pack. The database fleet has been deliberately split — part-time databases on usage-based pricing, steady ones on committed instances — and the non-production line now falls when projects wind down. What's the right read?

Keep learning

Dig deeper into Aurora Serverless v2 mechanics, capacity sizing, and the cost trade-offs.

That's the lesson. Two takeaways worth holding onto: part-time databases shouldn't be billed full-time, and the goal is matching pricing model to usage — not serverless everywhere. The leadership question is whether database cost tracks real usage or sits flat regardless.

Back to the library

Part of the learning path Right-size your compute

Convert idle RDS to Aurora Serverless v2

Aurora Serverless v2: the basics

The weekend that never paid for itself

Converting an idle database in action

Aurora Serverless v2 under the hooddeep dive

What is the impact of leaving idle databases on provisioned instances?

How do you convert idle databases to Serverless v2 safely?

1. Measure utilisation before you decide anything

2. Segment the fleet: serverless wins vs commit wins

3. Migrate non-destructively via reader-then-failover

4. Set MinCapacity and MaxCapacity deliberately

Quick quiz

Keep learning

Aurora Serverless v2: what it means for the bill

The weekend that never paid for itself

How a finance partner frames the candidate list

Why this matters to the budget, not just the bill

What finance can actually do about this

1. Put non-production database utilisation on the monthly report

2. Sequence reservations after segmentation, never before

3. Require a documented MaxCapacity ceiling on every serverless database

4. Track the utilisation trend, not just the dollar

Quick quiz

Keep learning

Aurora Serverless v2: the headline

The weekend that never paid for itself

What it looks like when the org gets this right

Why this is on the report at all

The leadership move on this category

1. Insist that pricing model matches usage pattern

2. Sequence commitments after the workload is understood

3. Make it a confidence signal at the leadership review

Quick quiz

Keep learning

Related cost lessons