Three CUD variants: resource based (deepest, rigid), flexible (best balance, up to 46 percent three year), and spend based (managed services). The p20 baseline sizing rule. BigQuery slot commitments at 40 percent below on demand. Eleven buyer moves.
Google Cloud Committed Use Discounts (CUDs) are the published commitment vehicle for GCP compute and selected services. They come in three variants, each with its own discount ceiling and flexibility profile.
The three variants stack with sustained use discounts (automatic up to 30 percent on long running Compute Engine instances) and with any negotiated PPA discount on top. This paper sets out actual CUD discount rates, the three variant decision logic, the under utilization trap, the commit shortfall risk, and the eleven move buyer side playbook. Read the related Google Cloud services practice, the Google Cloud PPA negotiation, and the GCP negotiation leverage framework.
| CUD type | Commit | Discount (1yr / 3yr) | Flexibility |
|---|---|---|---|
| Resource based | Specific vCPU + memory in specific region | Up to 37% / 57% | Low |
| Flexible | Dollar hourly Compute Engine spend across families | Up to 28% / 46% | High |
| Spend based | Dollar hourly spend on specific service (Cloud SQL, Cloud Run, etc.) | Up to 20% / 28% | Service specific |
Resource based delivers the deepest discount but the lowest flexibility. Flexible delivers the best balance for most Compute Engine estates with mixed machine families. Spend based is service specific and only worth running on managed services where there is no resource based option.
Resource based CUDs apply to a specific vCPU count and memory amount in a specific region, for a specific machine family (n2, n2d, c3, e2, etc.). A three year resource based CUD on n2 standard delivers approximately 57 percent off on demand pricing. The commitment is rigid: the customer pays for the committed capacity regardless of whether it is consumed, and changes to machine family or region require burning the original CUD and starting over. Resource based works for predictable steady state workloads where the customer has high confidence in the instance mix for three years. Production application servers, predictable database tiers, and stable analytics infrastructure are typical candidates.
Spend based CUDs commit to an hourly dollar amount of consumption on a specific service. They cover Cloud SQL, Cloud Run, Cloud Spanner, Bigtable, App Engine, Cloud Functions, and BigQuery slot commitments. The discount ceiling is lower than resource based (typically 20 to 28 percent for three year commits) but the commit is more forgiving because it applies across instance configurations within the service. For BigQuery customers, the slot commitment math is particularly important: slots can be purchased as flex (no commit), one month, one year, or three year, with progressively deeper discounts. Three year BigQuery slot commitments at scale frequently land at 40 percent below on demand BigQuery pricing.
Flexible CUDs (introduced by Google as the modern replacement for the older spend based Compute Engine CUD) commit to a dollar hourly Compute Engine spend that applies automatically across machine families and regions. The three year discount is approximately 46 percent off on demand. Flexible CUDs sit between resource based (rigid, deeper discount) and spend based (very flexible, lower discount), and for most enterprise customers running a mix of Compute Engine machine families this is the right default. The buyer side rule is: use resource based CUDs to cover the predictable steady state baseline at the deepest discount, layer flexible CUDs against the variable but committed compute spend, and leave on demand for the burst layer.
CUD commitments are billed regardless of actual usage. If the customer commits to a three year resource based CUD covering 100 n2 standard vCPUs and only uses 70, they pay for the full 100.
The buyer side rule is to size the resource based commit at the conservative baseline (typically the p20 or p30 of trailing twelve months consumption), with the flexible CUD layer covering the next ten to twenty percent of utilization, and on demand handling the variable layer above that.
The eleven move framework, the resource based CUD framework, the spend based CUD framework, the flexible CUD framework, and the buyer side moves at every step of the contracted Google Cloud cycle.
Used across more than five hundred enterprise software engagements. Independent. Buyer side.
Google proposed three year resource based CUDs across our full Compute Engine estate at the optimistic forecast. Redress sized resource based at the p20 baseline, layered flexible CUDs against the variable band, and split BigQuery into a separate three year slot commitment. Total discount realization on the GCP compute and analytics bill went from twenty four percent under the original proposal to forty one percent.
Renewal in twelve months. Audit notice in the inbox. RFP on the desk. We start where you are.
Google Cloud CUD framework signals, Google Cloud commitment framework signals, Google Workspace framework signals, and the broader Google Cloud licensing leverage signals across the practice.