FinOps dashboard showing AI token spend by team
GenAI / FinOps

AI cost management enterprise playbook.

AI cost is the fastest growing line on the cloud bill. Most of it sits outside FinOps. This playbook shows how to bring AI spend under control without slowing the rollout.

Contact Us GenAI Practice
500+Enterprise clients
$2B+Under advisory
Industry Recognized
500+ Enterprise Clients
$2B+ Under Advisory
11 Vendor Practices
100% Buyer Side Independent

AI cost is doubling every nine to twelve months across most enterprises. Bring it inside FinOps before it becomes the largest shadow line on the cloud bill.

Key takeaways

  • AI cost grows three to four times per year across most enterprises in our portfolio.
  • Almost half of AI spend sits outside FinOps tagging today. Fix that first.
  • Token economics are not intuitive. Prompt design, context length, and caching all change unit cost by an order of magnitude.
  • Negotiated rate cards beat retail pricing by 25 to 60 percent at scale.
  • Model choice is the single biggest cost lever. Use the cheapest model that meets the quality bar.
  • Treat AI like cloud. Tag, chargeback, alert, and cap. Same playbook, new resource type.
  • Set a unit economics target per use case. Then defend it.

AI spend is the fastest growing line on the cloud bill at almost every enterprise we work with. The pattern is familiar. New technology, scattered ownership, no tagging, no chargeback, no caps. The result is a line item growing three to four times per year with no clear owner.

This playbook lays out the four cost levers, the controls that actually hold, and the org model that gets AI spend back under finance ownership without slowing the rollout.

The shape of enterprise AI cost

Where the spend lives

AI cost shows up in three places. Native foundation model APIs. Cloud provider AI services. SaaS tools with embedded AI features. The fastest growing piece is usually the third, because no one is tagging it.

  • Foundation model APIs. OpenAI, Anthropic, Google, AWS Bedrock, Azure OpenAI. Tracked, but rarely tagged at the team level.
  • Embedded AI in SaaS. Copilot, Einstein, Now Assist, and similar. Usually billed as add ons inside the larger SaaS contract.
  • Internal model hosting. Self hosted open source models running on GPUs. Easiest to lose track of when the GPU cost moves to another team.

Unit economics by use case

Cost per use case varies by two orders of magnitude. Customer support deflection looks cheap per ticket. Long context document analysis can cost dollars per call. Without unit metrics, the executive view is a single growing number with no story.

The four cost levers

Four levers explain almost all of the variance in enterprise AI spend.

1. Model choice

The cheapest model that meets the quality bar wins. Most teams default to the largest model. For most use cases, a smaller or distilled model gets 90 percent of the quality at 10 to 20 percent of the cost.

2. Prompt and context design

Token count is cost. Cutting context length by half cuts cost by roughly half. Caching repeated context drops the bill further. Most teams over send context out of habit.

3. Negotiated rate cards

Retail pricing is the wrong starting point at enterprise scale. Negotiated rate cards routinely save 25 to 60 percent. The negotiated rate is also the floor you benchmark every subsequent renewal against.

4. Caps and quotas

Soft caps and quota alerts at the team level stop runaway use cases before they bend the budget. The cap rarely fires. It just shifts behavior.

AI cost reduction levers at typical enterprise scale

Lever Effort to deploy Typical saving Time to value
Model right sizingLow30 to 70%Two to four weeks
Prompt and context trimLow20 to 50%Two to six weeks
Rate card negotiationMedium25 to 60%Eight to twelve weeks
Caching repeated contextMedium20 to 40%Four to eight weeks
Quota and alertingLow10 to 20%Two weeks
FinOps tagging and chargebackMediumIndirect, structuralSix to twelve weeks

The controls that actually hold

Pricing tactics without controls are a one quarter win. The controls keep the discipline through the next launch.

Tagging and attribution

Every model call should carry a team or product tag. Provider native attribution is usually weak. Build a thin proxy layer or use one of the AI gateway tools that already do this.

Chargeback and showback

Showback is the floor. Chargeback is what changes behavior. The first month of chargeback is usually the moment teams discover that the cheap experiment was actually a six figure habit.

Unit economics targets

Each use case gets a target cost per call or per outcome. The product team defends it. Without a unit target, every use case grows toward the budget ceiling.

The cheapest model that meets the quality bar wins. Most teams default to the largest model and find out later that the bar was lower than they thought.

Who owns AI cost

AI cost has the same ownership trap as cloud did. Finance owns the number. No one owns the behavior.

FinOps inherits AI

The cloud FinOps team is the natural home. They already run tagging, chargeback, and unit economics. Adding AI is incremental, not a new function.

Product owns the use case

Product or engineering leadership owns the unit target. Finance reports on it. FinOps automates the alerts. Procurement owns the rate card.

AI council for cross cutting calls

A standing council across finance, security, legal, and engineering decides model selection, rate card refresh, and the use cases that get unbounded spend versus capped spend.

Your 90 day AI FinOps plan

Three months of focused work gets most enterprises from no control to credible control.

Days 0 to 30. Visibility

Tag every model call. Build the dashboard. Identify the top ten teams and use cases. Confirm the rate card on every active contract.

Days 30 to 60. Levers

Switch the obvious model choices. Cut prompt and context bloat. Renegotiate the rate card where the data supports it.

Days 60 to 90. Controls

Stand up chargeback. Set unit targets per use case. Calendar the renewal. Hand the operating model to FinOps.

Suggested reading

What to do next

  1. Pull a single view of every AI contract and add monthly run rate.
  2. Tag every model call at the team or product level.
  3. Identify the top five use cases and their unit economics.
  4. Pick the obvious model switches that drop cost without losing quality.
  5. Renegotiate the rate card where data shows the gap to benchmark.
  6. Stand up showback this quarter, chargeback next quarter.
  7. Set unit targets per use case and review monthly with product leads.
  8. Talk to a buyer side advisor when annual AI spend crosses one million dollars.

Frequently asked questions

How fast is enterprise AI spend really growing?

Three to four times per year is the typical pattern across our portfolio. Some are growing faster. None are flat.

Does FinOps cover AI today?

Rarely. Most FinOps teams have AI on the roadmap but not yet in tagging and chargeback. Bringing it in is the first move.

Are negotiated rate cards really worth it under a million dollars?

Often yes. Even at the lower seven figure level we routinely see 25 percent or more come off retail. Below that it depends on the vendor.

Should we standardize on one foundation model vendor?

Standardize on a primary, qualify a secondary. Single vendor is fragile. Three or four vendors are unmanageable.

How do we charge back AI cost fairly?

Tag at the team and use case level. Charge on actual usage. Publish unit economics monthly. Disputes shrink fast once teams see their own numbers.

Is AI cost a procurement problem or an engineering problem?

Both. Procurement owns the rate card. Engineering owns the unit metric. Finance owns the report. The cleanest org model has all three pulling on AI together.

AI Platform Contract Playbook

The full ai platform contract playbook framework from the GenAI Practice.

GenAI vendor contract red lines, IP indemnity posture, data use clauses, and the buyer side moves across the AI platform stack.

Used across more than five hundred enterprise engagements. Independent. Buyer side. Built for procurement leaders running the next renewal cycle.

No spam. We will only email you about this download. Privacy.
Run the AI vendor lock in risk assessment in under five minutes.
Open the Tool →
3.4x
YoY AI Spend Growth
46%
Outside FinOps
28%
Typical Waste
12 mo
Payback
100%
Buyer Side

AI cost without FinOps is the same mistake we made with cloud in 2014. Different sticker, same lesson.

Morten Andersen
Co Founder, Redress Compliance
Deep Library

More on this topic.

GenAI Practice →
Enterprise contract negotiation table with AI vendor terms on screen
GenAI
Enterprise AI contract negotiation playbook for 2026.
The buyer side moves on red lines, indemnity, data use, and benchmark pricing across the AI stack.
14 min read
Boardroom with governance framework on display
GenAI
Enterprise AI governance playbook 2026.
How to stand up AI governance that procurement, legal, and security all sign off on.
12 min read
Data center racks with privacy overlays
GenAI
AI data governance for enterprise agreements.
Privacy, IP, and training data clauses that decide whether you can ship the rollout.
11 min read
AI knowledge hub illustration
GenAI
GenAI Knowledge Hub.
Every framework, benchmark, and playbook across the GenAI vendor stack.
8 min read
Editorial boardroom interior

The advisor your vendors do not want.

500+ enterprise clients. 11 vendor practices. Industry recognized. One conversation can change what you pay for the next three years.

AI FinOps in your inbox each month.

Monthly briefings on token pricing, model unit economics, and the controls that hold AI spend down.