NVIDIAJune 7, 20263 sources

NVIDIA unveils RTX Spark, putting 128GB unified memory and 120B local models on Windows PCs

AI Analysis

NVIDIA used its Taipei keynote (Computex/GTC Taipei 2026, June 1) to unveil RTX Spark, a Blackwell-based superchip pairing up to 6,144 CUDA cores with 128GB of unified memory and roughly one petaflop of compute, aimed squarely at personal AI PCs. Jensen Huang's pitch: run agentic AI workloads — including a 120-billion-parameter model — locally on Windows laptops and desktops, no cloud round-trip required.

The 128GB unified memory is the architectural unlock; it's what lets a single consumer-class machine hold a 120B model in memory, a capability previously reserved for datacenter or high-end Mac configurations. Breakdowns on Medium framed it as a genuine turning point for on-device development, while skeptics on HN (257 pts, 449 comments) debated whether the 'beast' specs translate to real-world throughput.

Competitively, RTX Spark is a direct shot at Apple's unified-memory M-series advantage and AMD's AI PC push, leveraging NVIDIA's software dominance (CUDA, the expanding Nemotron coalition) to bring high-end AI compute to the mainstream Windows base. It also dovetails with the local-first theme dominating r/LocalLLaMA this week — Gemma 4 QAT, homelab GPU rigs, daemon-free Rust toolkits.

Caveats: pricing, thermals, and sustained performance under real agentic loads are unproven, and 'can run a 120B model' says nothing about usable tokens-per-second. What to watch: independent benchmarks, OEM laptop availability, and whether local inference at this scale dents cloud-inference demand for prosumer workloads.

Sources

dev.to

https://dev.to/eli_9c82b7dfe52c1bc371ffe/nvidias-blackwell-chip-arrives-on-windows-pcs-challenging-apple-and-amd-c06

cnet.com

https://www.cnet.com/videos/spark/

youtube.com

https://www.youtube.com/watch?v=H4nJo-oqAro