The day's most important AI news: breakthroughs, releases, funding, and policy — curated for developers, founders, and investors.
Today's Stories
Anthropic overtakes OpenAI at $965B, files confidential draft S-1 for IPO
Vendor: Anthropic
Anthropic's Series H round — led by Altimeter, Dragoneer, Greenoaks and Sequoia, with Amazon committing up to $25B — pushed it to a $965B post-money valuation, surpassing OpenAI as the most valuable AI startup. The company confidentially submitted a draft Form S-1 to the SEC under Section 135 of the Securities Act, giving it the option to pursue an IPO pending review. Annualized revenue reportedly jumped from $9B to over $30B, driven largely by Claude Code.
Anthropic ships Claude Opus 4.8 with 1M-token context, 41 days after 4.7
Vendor: Anthropic
Anthropic released Claude Opus 4.8, an upgraded model with a 1M-token context window, stronger coding and agentic performance, and claimed 4x better honesty over GPT-5.5 — arriving just 41 days after Opus 4.7. The model became new SOTA on ARC-AGI-3 (scoring 1.5% at ~$10K) and is positioned around 'dynamic workflows' orchestrating large numbers of agents.
OpenAI offers UK banks GPT-5.5 Cyber after Anthropic blocks Claude Mythos
Vendor: OpenAI
OpenAI offered nine major UK banks access to its GPT-5.5 Cyber security tool — which finds hidden weaknesses in digital systems — after rival Anthropic blocked the same banks from previews of Claude Mythos. The move turns the OpenAI–Anthropic rivalry into a direct contest over enterprise cybersecurity, against a backdrop of regulator unease.
OpenAI revives Robotics division to build infrastructure robots
Vendor: OpenAI
OpenAI announced it is hiring for a revived Robotics division — its strongest robotics push since shutting the team in 2021 — focused initially on robots that help build data centers, power grids and other critical infrastructure. Sam Altman and Greg Brockman are personally backing the effort.
OpenAI upgrades GPT-5.5 Instant, retires legacy ChatGPT models
Vendor: OpenAI
OpenAI is rolling out an update to GPT-5.5 Instant in ChatGPT and via API, improving response style for more natural, concise conversations and moving writing/coding directly into chat (removing canvas). It is concurrently retiring older models including o3 (August 26, 2026) and GPT-4.5 (June 27, 2026).
AWS brings OpenAI GPT-5.5, GPT-5.4 and Codex to general availability on Bedrock
Vendor: AWS
OpenAI's GPT-5.5, GPT-5.4 and Codex are now generally available on Amazon Bedrock, giving enterprises frontier-model access inside existing AWS governance — IAM, KMS encryption, VPC, and CloudTrail audit logging. Codex shifts to pay-per-token billing without seat licenses, and AWS added Bedrock Managed Agents powered by OpenAI in preview. Anthropic's Claude Opus 4.8 also landed on Bedrock the same week.
AWS open-sources ExtendDB, a DynamoDB-compatible database backed by PostgreSQL
Vendor: AWS
AWS open-sourced ExtendDB, a Rust implementation of the DynamoDB wire protocol that stores data in PostgreSQL while supporting the full API surface — offering a path to DynamoDB compatibility without AWS lock-in.
Anthropic expands Project Glasswing to ~150 organizations, releases Claude Security
Vendor: Anthropic
Anthropic is expanding Claude Mythos Preview access to roughly 150 new organizations across 15+ countries, including critical-infrastructure operators, after Glasswing partners identified over 10,000 high- or critical-severity flaws. It also released Claude Security, using Claude Opus 4.8 to scan codebases and suggest patches.
Google makes Gemini 3.5 Flash the default model in Search AI Mode globally
Vendor: Google
Google upgraded Search's AI Mode to run Gemini 3.5 Flash as its new default model worldwide, targeting frontier agent and coding performance, and expanded Personal Intelligence in AI Mode to nearly 200 countries and 98 languages, with generative-UI capabilities arriving this summer at no cost.
NVIDIA launches Cosmos 3, an open omnimodel framed as robotics' 'ChatGPT moment'
Vendor: NVIDIA
NVIDIA released Cosmos 3, billed as the first fully open omnimodel for physical AI, combining world generation, physical reasoning and action generation in a single Mixture-of-Transformers architecture. Available on Hugging Face with Super and Nano variants plus Diffusers integration, it targets robotics, autonomous vehicles and smart spaces.
NVIDIA unveils RTX Spark superchip and Vera CPU at Computex
Vendor: NVIDIA
Jensen Huang announced the RTX Spark superchip — pairing a Blackwell RTX GPU (6,144 CUDA cores, one petaflop) with a custom 20-core Grace CPU built via MediaTek over NVLink and 128GB unified memory — plus the Vera CPU for data-center agentic AI, expected Q3 2026. Satya Nadella tied RTX Spark to delivering 'unmetered intelligence' on Windows.
NVIDIA unveils Nemotron 3 Ultra, a 550B-parameter open model for AI agents
Vendor: NVIDIA
NVIDIA launched Nemotron 3 Ultra, a 550-billion-parameter mixture-of-experts open model built for long-running autonomous agents across coding, research and enterprise workflows, claiming up to 5x faster inference and 30% lower running costs, with weights due on Hugging Face, ModelScope and OpenRouter around June 4.
Microsoft's Suleyman unveils seven in-house MAI models including MAI-Thinking-1
Vendor: Azure
Microsoft AI chief Mustafa Suleyman announced seven new in-house MAI models, led by reasoning model MAI-Thinking-1 and coding model MAI-Code-1-Flash, plus Frontier Tuning capabilities — a clear signal Microsoft is building beyond its OpenAI dependency. The models drew heavy Hacker News attention.
Microsoft heads to Build with Copilot 'super app' and Copilot Health
Vendor: Azure
Microsoft is set to reveal at Build a Copilot 'super app' merging its various Copilot assistants into a single interface, alongside Windows improvements and Copilot Health — which connects wearables and wellness apps starting with Apple Health plus records from 50,000+ US provider organizations. GitHub Copilot's new usage-based billing drew developer backlash.
Alibaba releases Qwen 3.7 Plus, a low-cost multimodal GUI agent
Vendor: Alibaba
Alibaba's Qwen team released Qwen 3.7 Plus, a multimodal model with vision input adding deep reasoning, self-programming, tool invocation and autonomous iteration, positioned as a roughly $0.40 GUI agent for high-volume enterprise workloads. It ranks 16th globally on the Vision Arena leaderboard.
WWDC 2026 to showcase Siri overhaul with third-party chatbot integration
Vendor: Apple
Ahead of WWDC on June 8, leaks point to iOS 27 featuring a redesigned Siri inside the Dynamic Island with a 'Search or Ask' prompt and the ability to route queries to third-party chatbots like Claude, Gemini and ChatGPT. Apple faces pressure to reset its AI narrative after iPhones slipped from the top of US buyer satisfaction.
Meta integrates Llama 3.1 405B into WhatsApp, doubles down on open-source
Vendor: Meta
Meta is emphasizing its open-source Llama strategy, integrating the Llama 3.1 405B model into WhatsApp to provide free generative AI to global users — a play to commoditize foundation models and let enterprises build proprietary architectures on Meta's free infrastructure.
xAI launches Grok Composer 2.5 inside Grok Build for agentic coding
Vendor: xAI
xAI released Composer 2.5, an agentic coding model available within Grok Build for SuperGrok and X Premium+ subscribers, designed for long-running tasks and complex instruction-following. Built on Moonshot's Kimi K2.5, it offers competitive pricing and a faster variant versus Grok Build 0.1.
MiniMax releases open-weights M3 model, plans China IPO
Vendor: Hugging Face
MiniMax released MiniMax M3, described as the first open-weights model combining three frontier capabilities including coding and agentic use with 1M context, and reported strong growth plus plans for a Mainland China listing. It also failed to dismiss a US lawsuit from Disney, Universal and Warner Bros Discovery over alleged IP theft.