Soaring AI bills push businesses toward cheaper open-source and Chinese models

Reuters reports that the surge in AI costs is fundamentally reshaping how businesses choose models, pushing them toward open-source and cheaper Chinese alternatives. The four most popular models on OpenRouter are now all Chinese, with DeepSeek holding the top spot — a striking inversion of the US-model dominance of a year ago.
The economics are stark: Chinese models charge as little as 18 cents per million tokens versus roughly $4 on average for US models, a more than 20x gap for routine tasks. Gartner adds a forward warning that AI-coding tool costs could exceed a developer's salary by 2028 if current trajectories hold, pressuring teams toward model routing, caching, and cheaper backends.
This is the connective theme tying together much of the week's news: Meta cut off from Google's Gemini for over-consumption, Anthropic and Amazon clashing over token-based billing, Uber reportedly burning its 2026 AI budget in four months, and Apple passing chip costs to consumers. Cost — not just capability — is now the dominant decision variable.
For US frontier labs, the combination of high prices and government-gated access is actively ceding share to open Chinese models. Skeptics caution about data-governance, compliance, and geopolitical risks of routing production workloads through Chinese models, and note that the cheapest token isn't always the cheapest outcome once quality and reliability are priced in. What to watch: whether US vendors cut prices in response, and how enterprises balance cost against compliance.