Claude Opus 4.8 launches with 1M-token context, Dynamic Workflows and a ~67% price cut
Claude Opus 4.8 is Anthropic's headline technical release, described as a 'more effective collaborator' with significant improvements in agentic coding, multidisciplinary reasoning, computer use, knowledge work and financial analysis. The model adds a 1M-token context window, a fast mode that runs 2.5x faster and roughly three times cheaper, and an output-token price cut reported at about 67% — landing near a flat $5 input / $25 output per million tokens. Anthropic also claims roughly 4x better 'honesty,' with the model flagging uncertainty rather than confabulating.
The standout feature is Dynamic Workflows in Claude Code, which lets the model coordinate parallel subagents on larger tasks, plus new effort-control settings in Claude.ai and Cowork that let users dial adaptive thinking up or down. Claude Code lead Boris Cherny and colleagues highlighted the /effort control as a way to trade latency for depth. AWS's Swami Sivasubramanian noted stronger self-verification 'great for spec-driven workflows' as Opus 4.8 landed in Kiro IDE.
Competitively, Opus 4.8 squares off against GPT-5.5 and Google's forthcoming Gemini 3.5 Pro, with the price cut clearly aimed at DeepSeek's intelligence-per-dollar pressure. But the launch was polarizing: r/Anthropic's 'Opus 4.8 nerfed??' thread (862 upvotes) and an r/OpenAI post claiming it 'craps itself in SimpleBench' show real-world regressions some users report despite benchmark gains.
The key caveat from builders: 'price ≠ cost.' Hacker News commenters warned that headline per-token cuts don't tame usage-based bills, citing Copilot users whose costs jumped to thousands per month. The cheaper fast mode could either democratize agentic workloads or accelerate the runaway-spend problem enterprises are now actively reining in.