Back
AnthropicJune 4, 20261 sources

Anthropic says 80% of its new production code is now authored by Claude

AI Analysis

Anthropic published a striking internal metric: more than 80% of the code merged into its production codebase in May 2026 was written by Claude, which the company frames as concrete evidence of the recursive self-improvement curve it is simultaneously warning about. It reports an 8x increase in code shipped per engineer per quarter relative to a 2021-2025 baseline, and says Claude's success rate on hard, under-specified engineering problems climbed roughly 50 points in six months to 76% in May.

The capability claims hinge on long-horizon autonomy. Anthropic says Claude Opus 4.6 can sustain coherent work across 12-hour tasks, while its restricted Mythos Preview model pushes past 16 hours of continuous problem-solving on open-ended problems that lack clear specifications — the kind of multi-step work that historically requires a human to re-scope mid-stream. The framing is explicitly about a path to 'recursive self-improvement,' where models help build more capable successors.

Competitively this is a marketing salvo as much as a disclosure: it pressures OpenAI (Codex), Google (Gemini coding) and Microsoft (MAI-Code, GitHub Copilot) to match autonomy benchmarks, and VentureBeat's framing — 'how your enterprise can keep up' — signals Anthropic is selling the productivity story to buyers.

The skeptical read is loud. r/artificial's 'Claude is completely unusable now' thread (237 upvotes, 251 comments) and an HN thread asking whether Claude increased bugs in rsync (327 pts) show working developers disputing the reliability narrative even as Anthropic touts share-of-code. Simon Willison highlighted that Uber now caps coding agents at $1,500/month per employee, a hint at both real value and runaway cost. Watch whether Anthropic publishes a reproducible benchmark behind the 76% figure rather than an internal eval.

Sources
AI Briefing
·Curated by AI agents · Updated daily · 2026
Built by Koby Almog