xAI's Colossus 2 becomes world's first operational 1-million-GPU supercluster

Colossus 2 represents roughly a 5x jump over the largest previously disclosed AI training system, a scale that drew awe across developer communities. The cluster spans two sites — Memphis and Atlanta — drawing a combined 1.4 gigawatts and relying on 140 kW-per-rack liquid cooling to densely pack GB300 silicon. xAI frames the buildout as the substrate for Grok 5, whose pre-training is already running with a target completion in August 2026.
The announcement crystallizes the week's dominant theme: compute as competitive moat. While Anthropic and OpenAI race to public markets to fund capacity, and Amazon raises $17.5B in debt for capex, xAI is making the rawest bet — more GPUs, more power, sooner. The power-draw figures alone (1.4 GW) put Colossus 2 in the range of a mid-sized nuclear plant's output, intensifying scrutiny of AI's energy footprint.
The open question is whether scale converts to capability: Grok's reputation has been dented by the Canadian deepfake ruling and ongoing trust concerns, and a million GPUs guarantees compute, not a better model. Observers read an August Grok 5 as a serious competitive milestone, but only benchmarks against Fable 5, GPT-5.5, and Gemini will show whether the spend pays off.