본문으로 건너뛰기
All news

The Tokenmaxxing Era Is Over: Enterprise AI Shifts from Volume to Value

Summary: A sweeping market correction is underway in enterprise AI: the era of "tokenmaxxing" — pushing developers to use frontier models without cost restraint — is giving way to hard budget caps and a mass migration toward cheaper open-source alternatives.

Key Points

  • Combined token share for OpenAI, Anthropic, and Google fell from 72% in June 2025 to 33% in June 2026, with ~40 percentage points shifting to open-source and Chinese models in a single year
  • Uber blew through its entire annual AI developer budget in four months; the company has since capped per-developer AI tool spend at $1,500/month
  • Lindy's CEO migrated 100% of company traffic from Anthropic Claude to DeepSeek, citing millions in projected annual savings
  • DeepSeek V4-Pro is priced at $0.44/M input tokens; GLM-5.2 at $1.40/M — both undercutting frontier models by 80–90%
  • Menlo Ventures data: OpenAI's enterprise LLM spend share fell from 50% (2023) to 27% (2026); Anthropic rose to 40% but now faces the same cost pressure

Why It Matters

Both OpenAI and Anthropic are preparing IPO filings — exactly the wrong time for a pricing war. Enterprise customers are demanding ROI proof and tighter cost controls just as investors are scrutinizing margin trajectories. If the Big 3 cut prices to match open-source, they compress margins ahead of public listings; if they hold prices, they risk further market share erosion. The structural advantage open-source models now carry — self-hostable, no data routing to external servers, near-zero marginal cost — is not a temporary discount. It is a permanent shift in the economics of enterprise AI.

Further Reading

뉴스레터 구독

무료 뉴스레터

매주 핵심 AI 소식, 한 번에 받기

쏟아지는 AI·LLM 뉴스 중 꼭 알아야 할 것만 골라 메일로 보내드려요. 뉴스레터 발송이 시작되면 구독자분들께 가장 먼저 보내드립니다.