Editorial photograph of an enterprise Google Cloud CUD review at the contracted Google Cloud cycle
Google Cloud · CUD · Article

Google Cloud Committed Use Discounts. Up to 70 percent off three year on resource based, and a commit shortfall trap most customers walk into.

Three CUD variants: resource based (deepest, rigid), flexible (best balance, up to 46 percent three year), and spend based (managed services). The p20 baseline sizing rule. BigQuery slot commitments at 40 percent below on demand. Eleven buyer moves.

Contact Us Google Cloud Practice
500+Vendor engagements
15 to 30%Google Cloud saving
Industry Recognized
500+ Enterprise Clients
$2B+ Under Advisory
11 Vendor Practices
100% Buyer Side Independent

Google Cloud Committed Use Discounts reward buyers who commit only to the consumption floor they can prove, and punish those who commit to forecasts that never arrive.

Key takeaways

  • Three variants: resource based, flexible, and spend based CUDs each trade discount depth for flexibility.
  • Resource based: the deepest discount, up to 70 percent on three year commitments, but rigid on machine family and region.
  • Flexible: applies across families and regions automatically, with a smaller but still material discount.
  • Baseline rule: commit to the p20 conservative floor, layer flexible on the variable band, burst on demand.
  • Shortfall trap: an over sized commitment bills whether or not you consume it.
  • Outcome: a layered commitment architecture moves 15 to 30 percent of compute spend in most estates.

How do Google Cloud Committed Use Discounts work?

A CUD trades a one or three year spending commitment for a discounted rate. The deeper the rigidity, the deeper the discount.

The three variants sit on a flexibility curve. Resource based is cheapest and most rigid, spend based is most flexible and shallowest.

What are the three CUD variants?

How deep do the discounts go?

Resource based three year commitments reach the deepest rates, with current bands published on the Compute Engine pricing page. Flexible sits below that, and spend based is shallower again.

What are the real discount levers in a CUD negotiation?

The published CUD rate is only half the picture. The platform wide discount and the commitment architecture move more money.

Three levers carry most of the value, and they stack.

  • Architecture: resource based for the floor, flexible for the variable band.
  • Stagger: spread commit end dates so they do not all renew together.
  • PPA stack: layer the platform discount on top of CUDs, not instead of them.

Google Cloud CUD variant comparison, illustrative

VariantDiscount depthFlexibilityBest fit
Resource basedDeepest, up to 70% three yearLow, fixed family and regionStable known workloads
FlexibleModerateHigh, any family or regionMixed Compute Engine estates
Spend basedShallowestService specificManaged services like BigQuery

Where the common advice on this topic is wrong

The standard account team pitch is to maximize the resource based commitment because it carries the deepest discount, so buyers commit to the full forecast. We disagree. In roughly 25 of the 35 Google Cloud estates we benchmarked in 2024 and 2025, the deeper rate was wiped out by 15 to 25 percent of unused commit that billed anyway when the forecast missed. The buyer side move is to commit resource based only to the p20 consumption floor you can prove from twelve months of telemetry, cover the variable band with flexible CUDs, and burst the rest on demand, because an unused deep discount costs more than a used shallow one.

An engineer reviewing cloud consumption dashboards on a wide monitor
Twelve months of consumption telemetry, classified by machine family and region, is the input that sets the safe commitment floor.
15 to 30%
Compute spend moved by layering
p20
Safe resource based commit floor
30 to 40%
BigQuery slot saving at steady state

Source: Redress Compliance advisory engagement file, 2024 to 2025.

An unused deep discount costs more than a used shallow one. Commit to the floor you can prove.

How does the p20 baseline rule work?

Take twelve months of hourly consumption, find the level you exceed 80 percent of the time, and commit resource based to that floor. That is the p20 baseline.

Everything above the floor goes to flexible CUDs or on demand. The floor is almost never wrong, so the deep discount is almost never wasted.

Why classify by machine family and region?

Resource based CUDs only apply to the exact family and region you commit. Classify consumption first, or the discount strands when workloads move.

What buyer side moves win a CUD negotiation?

Bring telemetry, not a forecast. The commitment that survives an audit is the one built from history.

  1. Classify twelve months of consumption by family, region, and service.
  2. Set the resource based commit to the p20 floor.
  3. Cover the variable band with flexible CUDs.

How do CUDs differ from sustained use discounts?

Sustained use discounts apply automatically to steady running instances with no commitment, while CUDs require a one or three year commit for a deeper rate. The two coexist on the same estate.

What telemetry do you bring to the negotiation?

Bring twelve months of hourly usage classified by machine family, region, and service. That history, not a sales forecast, sets a commitment that survives the term.

What to do next

  1. Export twelve months of hourly consumption and classify it by machine family, region, and service.
  2. Calculate the p20 floor and commit resource based only to that level.
  3. Cover the p20 to p70 variable band with flexible CUDs.
  4. Size BigQuery slot commitments to steady state, not peak.
  5. Stagger commitment end dates across quarters to avoid a single renewal cliff.
  6. Stack the platform wide discount on top of the CUD architecture rather than treating them as alternatives.

Download the GCP Negotiation Framework

Get the buyer side framework our advisors use on live Google Cloud engagements. The CUD math, the p20 baseline rule, and the commitment sequence.

Download the GCP Framework
Run the software spend health check against your Google Cloud estate in minutes.
Start the assessment

Frequently asked questions

What are the three Google Cloud CUD variants?

Resource based CUDs commit to specific vCPU and memory in a region for the deepest discount, flexible CUDs commit to an hourly spend across families and regions for a moderate discount, and spend based CUDs commit to a managed service spend for a shallower service specific discount.

How much can a Google Cloud CUD save?

Resource based three year commitments reach the deepest rates, up to roughly 70 percent on covered compute, while a layered architecture across resource based, flexible, and on demand typically moves 15 to 30 percent of total compute spend in practice.

Resource based or flexible CUD, which should I use?

Use resource based for the stable consumption floor where the machine family and region are known with high confidence, and flexible for the variable band because it applies automatically across families and regions even when workloads move.

What is the p20 baseline rule?

The p20 baseline is the consumption level you exceed about 80 percent of the time over twelve months. Committing resource based only to that floor means the deep discount is almost never wasted on capacity you do not use.

What is the commit shortfall trap?

The shortfall trap is committing to a forecast that never arrives. A resource based CUD bills the committed amount whether or not you consume it, so an over sized commitment quietly erases the discount it was meant to deliver.

Can CUDs stack with a platform discount?

Yes. The platform wide discount applies on top of the CUD architecture, so the two are complementary rather than alternatives, and the strongest position stacks both.

When should I negotiate CUDs?

Negotiate CUDs alongside the broader platform discount conversation, after you have twelve months of consumption telemetry classified by machine family, region, and service, so the commitment is built from history rather than a sales forecast.

How are BigQuery commitments priced?

BigQuery slot commitments are sized to steady state query demand and typically land 30 to 40 percent below on demand pricing when committed at the right level, which is why steady state sizing matters more than peak.