← All vendors
Google logo
Vendor

Google AI News

Every AI news story AI Briefing has published about Google — 130 articles spanning Apr 5, 2026 – Jun 21, 2026. Track Google's model releases, research papers, product launches, funding rounds, and partnerships across the AI industry, updated daily.

130 articles · Apr 5, 2026 – Jun 21, 2026

Google DeepMind treats its own advanced AI agents as 'insider threats'

Google DeepMind unveiled a cybersecurity roadmap that conceptualizes advanced AI such as its Gemini Spark agent as potential 'insider threats.' The approach involves auditing millions of coding-agent tasks to build a live monitor that detects issues like unintentional data deletions, with secondary AI models acting as supervisors. Current flags mostly stem from agent misunderstanding, but the framework is designed for more capable future models.

2026-06-21

Gemini co-lead Noam Shazeer departs Google for OpenAI

Noam Shazeer, Google VP of Engineering and Gemini co-lead, is leaving for rival OpenAI, weeks after Google unveiled Gemini 3.5 Flash and the Gemini Spark agent at I/O. Separately, Gemini Live gained access to past chats, Memory and Connected Apps, reaching feature parity with the main chat experience.

2026-06-21

Google reinvents the smart speaker with Gemini in June Pixel Drop

Google is using Gemini to reinvent the smart home speaker — adding mid-sentence corrections, two-way conversations, and 10 new voices. The June 2026 Pixel Drop and Android 17 also expand Gemini features including Gemini Omni video editing for Pro users and the Lyria 3 music generator.

2026-06-19

Google DeepMind builds Gemini AI to halve UK house-building planning times

Google DeepMind, partnering with the UK government, Google Cloud, and Faculty, is developing a Gemini-powered prototype to help planning officers process homeowner applications, aiming to cut processing times by up to 50% through automated data extraction and case analysis. It builds on the earlier 'Extract' tool that digitizes old planning documents.

2026-06-18

Google's AMIE medical AI matches physicians in disease management, Nature finds

Google Research published in Nature that its conversational AI system AMIE matches primary-care physicians in complex disease management, signaling progress toward AI assisting in ongoing chronic care rather than just one-off diagnosis.

2026-06-18

Google's Gemma 4 open-weight family arrives on Amazon Bedrock under Apache 2.0

Google DeepMind's Gemma 4 family is now available on Amazon Bedrock under the Apache 2.0 license. The release includes three instruction-tuned variants — Gemma 4 31B, 26B-A4B, and E2B — designed for strong intelligence-per-parameter across diverse deployment scenarios, landing as a headline launch at the AWS Summit NYC.

2026-06-16

Gemini Guided Learning lifts student math scores in Sierra Leone RCT

Google DeepMind and Fab AI released results from a randomized controlled trial in Sierra Leone where 1,763 students using Gemini Guided Learning improved math scores by 0.258 standard deviations. The study found AI served as a pedagogical partner augmenting teacher-led lessons rather than replacing them, the first in a planned series on AI in education.

2026-06-16

Google DeepMind's Gemma 3 powers a satellite that searches for objects on its own

Google DeepMind's Gemma 3 vision-language model, purpose-built for edge hardware far from data centers, enabled a satellite to autonomously search for and classify objects for the first time. The VLM combines LLM-style reasoning with image analysis on limited onboard compute, demonstrating frontier capabilities running at the extreme edge.

2026-06-16

Google Launches Gemini 3.5 Live Translate Across 70+ Languages

Google released Gemini 3.5 Live Translate, a near-real-time speech-to-speech translation system spanning 70+ languages that preserves tone and pitch, alongside the open-weight DiffusionGemma diffusion language model, Chrome auto-browse for Android, and reduced AI Plus pricing in the US.

2026-06-15

Vertex AI Rebranded as Gemini Enterprise Agent Platform with Breaking SDK Changes

Google rebranded Vertex AI as the Gemini Enterprise Agent Platform, introducing new SDK package names and endpoint URLs that require developer migration ahead of a June 24 Vertex deprecation. Managed Agents consolidate self-managed loops into single calls, with pricing based on execution time and tool-call volume.

2026-06-15

Google Sues Chinese Cybercrime Network for Using Gemini to Automate Scams

Google launched legal action against a Chinese group called Outsider Enterprise, allegedly responsible for a massive AI-powered scam campaign that used Gemini to build spam and fraudulent messages at scale.

2026-06-15

Google releases DiffusionGemma, an open-weight diffusion LLM hitting 1,000+ tokens/sec

Google released DiffusionGemma, an experimental open-weight model that uses diffusion rather than autoregressive generation for exceptionally fast text output — exceeding 1,000 tokens/sec on an H100 and 700+ on an RTX 5090. NVIDIA shipped an optimized NVFP4 variant with a 256K context window and 35+ language support.

2026-06-14

Google DeepMind releases Lyria 3 for high-fidelity long-form music generation in Gemini

Google DeepMind brought Lyria 3 to the Gemini app, enabling professional-grade music generation of up to three-minute, high-fidelity tracks from text or image prompts. The model handles foundational musical structure and seamless transitions, with SynthID watermarking baked in so AI-generated audio remains identifiable even after edits.

2026-06-13

Google sues China-based cybercrime group over Gemini-powered scam network

Google filed a federal lawsuit alleging a China-based cybercrime group used its Gemini AI to build fake corporate and government websites, send fraudulent texts, and steal millions by impersonating trusted brands. Google is working with the FBI and wireless carriers to disrupt the operation. Separately, Gemini suffered a publicly acknowledged outage on June 10.

2026-06-13

Google DeepMind releases DiffusionGemma, a 26B open model generating text 4x faster via diffusion

Google DeepMind released DiffusionGemma, a 26-billion-parameter open-weights model that generates text using diffusion at speeds up to four to five times faster than comparable autoregressive models — over 1,000 tokens per second on a single Nvidia H100. It is available under Apache 2.0 on Hugging Face, Kaggle, and Vertex AI, though it scores lower than Gemma 4 on benchmarks.

2026-06-12

Google upgrades NotebookLM with Gemini 3.5 and Antigravity, cuts AI Plus price, then hits June 10 outage

Google upgraded NotebookLM with its Gemini 3.5 model and the Antigravity capability, adding a cloud computer and source-finding tools, and cut the price of its AI Plus plan while doubling included cloud storage. On June 10, Gemini experienced a major outage with users scrambling for workarounds.

2026-06-11

Google launches Gemini 3 Deep Think and expands AI Mode Personal Intelligence globally

Gemini 3 Deep Think began rolling out to Google AI Ultra subscribers as Google's top reasoning model for math, science, logic and multi-step problems. Google also expanded Personal Intelligence in AI Mode to nearly 200 countries across 98 languages with no subscription required, and introduced Gemini 3.5 Flash Live Translate for real-time speech-to-speech translation.

2026-06-10

NotebookLM upgraded to Gemini 3.5 with Antigravity and a secure cloud computer

Google upgraded NotebookLM's chat tool to run on Gemini 3.5 plus its agentic coding platform Antigravity, improving accuracy and reasoning visibility. Each notebook now includes a secure cloud computer that lets NotebookLM write and run code for deeper research, alongside new output formats and tools to find sources beyond a user's own files.

2026-06-09

SafeBreach flaw lets attackers hijack Gemini voice assistant via notifications

SafeBreach disclosed a critical flaw that let attackers hijack Google's Gemini voice assistant through messaging notifications, enabling control of smart-home devices and triggering actions like Zoom calls. The disclosure highlights prompt-injection risks in always-on consumer AI assistants.

2026-06-07

Google ships Gemma 4 12B on-device model running on 16GB laptops

Google released Gemma 4 12B, which delivers multi-step reasoning approaching its 26B MoE model while running locally on a standard laptop with just 16GB of RAM, alongside an AI Edge Gallery app for Mac. The open model also features quantization-aware training, drawing strong interest on r/LocalLLaMA.

2026-06-07

Google's Gemma 4 12B runs locally on a 16GB laptop with encoder-free multimodal design

Google released Gemma 4 12B, an 11.95B-parameter Apache 2.0 open model optimized to run locally on a standard enterprise laptop using just 16GB of memory. Its encoder-free 'Unified' architecture feeds raw audio waveforms and visual patches directly into the LLM backbone, with a 256K context window and native agentic tool use. It's available on Hugging Face and Kaggle, with AI Edge Gallery now on macOS for local use.

2026-06-06

Google launches Gemma 4 12B, an encoder-free multimodal model that runs locally on a 16GB laptop

Google released Gemma 4 12B, an 11.95-billion-parameter open-weights model with an Apache 2.0 license that runs locally on a standard 16GB enterprise laptop. Its breakthrough is an encoder-free 'Unified' architecture letting raw audio and visual patches flow directly into the LLM backbone, with a 256K context window and native agentic tool use. It topped Hacker News with 1,018 points and ships on Hugging Face, Kaggle and AI Edge Gallery (now on macOS).

2026-06-05

Google makes Gemini 'Extended' thinking free for everyone

Google made Gemini's 'Standard' and 'Extended' thinking levels free for all users with no subscription required, reserving only the heaviest 'Deep Think' mode for Ultra subscribers. The move widens access to multi-step reasoning and pressures rivals who gate advanced reasoning behind paywalls.

2026-06-05

Google ships agentic Gemini Spark and Gemini Omni video model at I/O 2026

Google released Gemini Spark, an agentic AI tool, alongside a redesigned Gemini interface, the Gemini Omni creative video model (image, audio, video, text input), and a desktop macOS app at I/O 2026. CEO statements framing an 'agentic Gemini era' — plus glasses partnerships with Warby Parker, Gentle Monster, and Samsung — sparked debate among developers over scope.

2026-06-02

Google rolls out agentic Gemini Spark and pitches Gemini 3.5 Flash for cost control

Google released Gemini Spark, an agentic tool that runs digital errands even while a phone is off, to AI Ultra subscribers in the US. It also touted Gemini 3.5 Flash as rivaling frontier models at far lower token cost, with Sundar Pichai warning companies are 'blowing through' token budgets, alongside Gemini Omni and a macOS desktop app.

2026-06-01

Google previews interactive Gemini features and Gemini Omni at Research I/O 2026

Google Research detailed upcoming Gemini features — interactive images, timelines and embedded videos — plus improvements to Gemini Omni, a new model generating diverse content from various inputs starting with video. Work with DeepMind targets factuality, multilinguality and efficiency. r/singularity's 'Google omni is underrated' thread hit 1,996 upvotes.

2026-05-31

Google DeepMind ships Gemini Embedding 2 — native multimodal RAG across text, image, video, audio, code

Google DeepMind released Gemini Embedding 2, a native multimodal embedding model available via the Gemini API and Vertex AI. It supports retrieval across text, image, video, audio, documents, and code, and Google claims state-of-the-art performance across multiple embedding benchmarks — aimed squarely at advanced RAG and multimodal search.

2026-05-30

Google I/O 2026 Rebuilds Search Around Gemini 3.5 Flash, Information Agents, Generative UI

Google I/O 2026 overhauled Search with Gemini 3.5 Flash, Information Agents, and generative UI widgets built on-the-fly. Engineers described it as stitching models, tools, browsers, Workspace, Android, Chrome, and Cloud into one agentic operating layer.

2026-05-29

YouTube to Automatically Label AI-Generated Videos

YouTube announced it will automatically label AI-generated videos to improve transparency for viewers and creators. The move topped Hacker News with 1,278 points and 769 comments amid debate over detection accuracy.

2026-05-29

Google launches Gemini Spark as a 24/7 autonomous AI agent

Google introduced Gemini Spark, positioned as a persistent always-on AI agent rather than a session-based chatbot. The product marks Google's most explicit push beyond chatbot territory into autonomous agent territory, integrated across Search, Workspace, and Android.

2026-05-28

Gemini for Science launches in Google Labs with Co-Scientist, AlphaEvolve and Empirical Research Assistance

Google launched Gemini for Science, a collection of experimental AI research tools in Google Labs plus new Science Skills in Google Antigravity. Built with Co-Scientist, AlphaEvolve, Empirical Research Assistance and NotebookLM, the tools target literature review, hypothesis generation and computational modeling. Access opens gradually starting May 2026.

2026-05-27

Anthropic researcher's 'joy, fear, grief' claim about Claude reignites model-welfare debate

An Anthropic researcher's quote — that the company keeps finding 'structures that mirror results from human neuroscience' inside its models, including 'evidence of introspection' and 'internal states that functionally mirror joy, satisfaction, fear, grief, and unease' — went viral on r/Anthropic (345 upvotes, 484 comments), reigniting the model-welfare debate.

2026-05-27

Gemini for Science: Co-Scientist, NotebookLM, and Antigravity Science Skills accelerate research

Google launched 'Gemini for Science,' a suite of experimental AI research tools in Google Labs plus new Science Skills in Google Antigravity. Tools include Co-Scientist and NotebookLM aimed at literature review, hypothesis generation, and computational discovery via code variations. Pushmeet Kohli also announced DeepMind's AlphaProof Nexus agentic framework for research-level math.

2026-05-26

Google triggers $1B AI price war with Gemini cuts and unveils Gemini Omni video generation

Alphabet told enterprises they could save up to $1B annually by switching from frontier models to Gemini after a fresh price cut, while at I/O 2026 it also launched Gemini Omni (turning text, audio, and images into video) and Gemini Spark, a $100/mo personal agent. ChatGPT's share of AI web traffic has reportedly dropped from 77% to 54% as Google's push lands.

2026-05-25

Google launches Gemini for Science with Co-Scientist, Alpha Evolve, and NotebookLM research tools

Google launched Gemini for Science, an initiative bringing agentic AI to scientific discovery and higher education. It includes experimental tools in Google Labs and new Science Skills in Google Antigravity, built with Co-Scientist, Alpha Evolve, Empirical Research Assistance, and NotebookLM, designed to accelerate literature review, hypothesis generation, and computational discovery.

2026-05-25

Google I/O 2026: Gemini 3.5 Flash, Gemini Omni video, and Antigravity 2.0 ship a full AI-first stack

At I/O 2026 Google unveiled Gemini 3.5 Flash as the new default model (with Gemini 3.5 Pro slated for next month), Gemini Omni and Omni Flash for multimodal video generation, and Antigravity 2.0 — a five-surface, single-agent-harness platform Google describes as a ground-up rewrite of its engineering stack to be AI-first and agent-first. Sundar Pichai said weekly Antigravity quotas have already been 3x'd to keep up with demand. Aggressive sub-half pricing for 3.5 Flash and a viral demo of Antigravity building an OS from scratch in 12 hours dominated coverage.

2026-05-23

Gemini Omni and Gemini 3.5 Flash launch at I/O 2026 with video focus and aggressive pricing

At I/O 2026, Demis Hassabis unveiled Gemini Omni and Gemini Omni Flash — a multimodal model translating between text, audio, images and video, with an initial focus on video generation ('Nano Banana but for video'). Alongside it, Gemini 3.5 Flash arrived as Google's fastest variant with sub-half pricing claims versus competitors. Google also debuted agentic features like Gemini Spark and Universal Cart, putting pressure on OpenAI and Anthropic.

2026-05-22

Google I/O 2026: Gemini 3.5 Flash, Spark Agent, Omni Video, and Universal Commerce Protocol

At I/O 2026, Google launched Gemini 3.5 Flash (now default in the Gemini app and Search AI Mode), four times faster than other frontier models in output tokens/sec, with Gemini 3.5 Pro arriving next month. The redesigned Gemini app got a 'Neural Expressive' design, a 'Daily Brief', the always-on Gemini Spark background agent, the Gemini Omni multimodal video model, and a new Universal Commerce Protocol / Agent Payments Protocol that lets Gemini autonomously track prices and execute purchases.

2026-05-21

Antigravity 2.0 Builds an OS in 12 Hours with 96 Agents for Under $1K — and Runs Doom

Google's reported Antigravity 2.0 multi-agent system built an operating system from scratch using 96 coordinated agents in 12 hours for under $1,000 in token costs — and successfully ran Doom on the result. The r/singularity thread (1,858 upvotes, 320 comments) framed it as a benchmark moment for multi-agent orchestration, though skeptics flagged reproducibility questions.

2026-05-21

Gemini 3.5 Flash, Gemini Omni, and Gemini Spark headline I/O 2026

Google launched Gemini 3.5 Flash at I/O 2026 — going straight to GA as the new default in the Gemini app, AI Mode in Search, and the Antigravity dev platform — alongside multimodal world-model Gemini Omni and a 24/7 personal agent called Gemini Spark. DeepMind's Demis Hassabis framed Omni as a step toward AGI, with video generation positioned to compete with Veo and Sora. Gemini now has 900M monthly users (more than double YoY); AI Overviews reaches 2.5B. Gemini 3.5 Pro ("Cappuccino") is slated for next month.

2026-05-20

Google reimagines Search with AI agents and generative interfaces

Google announced a sweeping overhaul of Search at I/O 2026, transforming it from static result lists into an AI workspace with conversational answers, autonomous agents, and dynamically generated interfaces. The shift expands the search box into a do-everything surface and is widely expected to further compress referral traffic to publishers.

2026-05-20

Antigravity 2.0 ships as standalone agent-first developer platform

Google released Antigravity 2.0 — a standalone desktop app for agent orchestration — alongside an Antigravity CLI, SDK, Managed Agents in the Gemini API, and Gemini Enterprise support. It marks a real architectural shift in how Google packages AI-assisted development, putting agents (not chat) at the center of the dev workflow.

2026-05-20

DeepMind's Genie world model integrates Street View for real-street simulation

Google DeepMind integrated Street View into Project Genie, enabling immersive interactive world simulations of real environments for robotics, gaming, and travel use cases. Users can explore environments with weather changes and rare scenarios, deepening Genie's path toward general-purpose world models.

2026-05-20

Google and Blackstone launch $25B AI infrastructure joint venture

Google and Blackstone announced a compute-as-a-service joint venture aimed at AI workloads, launching with $5B in equity from Blackstone for a majority stake and reported to be worth $25B in total. It's one of the largest AI infrastructure tie-ups disclosed to date.

2026-05-20

Gemini for Science day 2: AlphaFold + AlphaGenome pitched to 'solve all diseases'

Day-two I/O coverage zoomed in on Gemini for Science — Google's push to wire AlphaFold and AlphaGenome into Gemini-powered biomedical research tooling. Sundar Pichai's 'solve all diseases' framing got attention and pushback, with clinicians flagging the gap between protein-structure wins and actual drug-pipeline timelines.

2026-05-20

Gemma 4 reproducibly emits three sequential outputs — including a self-disclaimer — in one response

At num_ctx=2048 and temperature=0.0, Gemma 4 E2B reproducibly emits three sequential outputs per response: a mostly-hallucinated summary, a 'Note:' disclaiming it, then a more careful retry. Other E-class models in the same envelope don't show the pattern.

2026-05-20

Logan Kilpatrick details Gemini Omni: 'create anything from any input', starting with video

Google's AI Studio lead Logan Kilpatrick detailed Gemini Omni on X: a multimodal model that creates 'anything from any input,' starting with video — framed as 'Nano Banana but for video.' Available in Gemini App, Flow, and YouTube, with API support coming soon.

2026-05-20

Google heads into I/O 2026 third in foundation models; 'Remy' agent leaks rattle architects

MIT Tech Review frames Google's I/O (starting May 19) as a critical moment to close ground on OpenAI and Anthropic. Leaked details on 'Remy,' an OpenClaw-style agent that acts on a user's behalf, have enterprise architects rethinking their AI infrastructure assumptions. Gemini app is rolling out a new 'Thinking level' menu with an Extended tier and new third-party integrations.

2026-05-19

Gemini 3 Flash leads Vercel's AI Gateway; Gemini Omni video and orbital data centers tee'd up for I/O May 19

Google's Gemini 3 Flash overtook Anthropic in early April on Vercel's AI Gateway by token volume and has held the lead since. Leaked screenshots reveal Gemini Omni, a video-editing-capable model, will debut at I/O on May 19-20, alongside agentic 'Gemini Spark' (persistent skills + task scheduler), Googlebooks laptops with a 'magic pointer,' and a SpaceX partnership for orbital solar-powered AI data centers.

2026-05-18

'Gemini Spark' agent leaks ahead of I/O 2026 with task scheduler and skill system

Code in Google App v17.20 revealed Gemini Spark, a proactive AI agent that pulls from linked apps, chat history, schedules, and location, and can execute tasks on a timed schedule via a skill system. The leak lands days before I/O 2026 on May 19, where Google is expected to formally unveil agentic Android features.

2026-05-17

Google declares GEO and AEO 'a myth' — traditional SEO still drives AI search

In new documentation, Google dismantles 'generative engine optimization' (GEO) and 'answer engine optimization' (AEO) as rebranded SEO. It dismisses LLMS.txt files and content chunking, stating AI search uses the same ranking systems as traditional search.

2026-05-17

Google preps new Gemini model and 'Gemini Spark' agent for I/O 2026

Google plans to unveil a new Gemini model at I/O on Tuesday — roughly GPT-5.5 class but short of Mythos — alongside leaks of 'Gemini Spark,' an always-on agent for inbox triage, flight booking, and online workflows. Gemini 3 Flash has already overtaken Anthropic on Vercel's AI Gateway token traffic since April.

2026-05-16

Google preps new Gemini for I/O, pushes Gemini as Android operating layer

Google plans to unveil a new Gemini model at I/O — reported as roughly GPT-5.5 class, short of Mythos — while expanding Gemini 3.1 Pro across Android, Chrome (auto-browse, summarization), cars, and laptops with agentic task completion and vibe-coded widgets. DeepMind also showcased a Gemini-powered intelligent mouse pointer prototype in Google AI Studio.

2026-05-15

Google DeepMind unveils an AI-enabled pointer for Gemini

Google DeepMind showed a research prototype of an 'AI-enabled pointer' that reads not just what users point at but their intent, letting Gemini act on on-screen objects without switching to a separate chat window. Demis Hassabis called the AI Studio prototype 'pretty magical.'

2026-05-15

Android 17 preview: 'Gemini Intelligence,' agentic widgets, and 7 hidden Gemini Live model codenames

At its Android Show: I/O Edition, Google previewed Android 17 with 'Gemini Intelligence' branding, agentic cross-app task completion, auto-browse in Chrome for Android, vibe-coded widgets, and a Rambler voice-dictation feature debuting on Pixel 11 and Samsung foldables. A server-side flag also exposed 7 hidden Gemini Live model codenames including 'Capybara' and 'Nitrogen.' The reveal lands two weeks before Apple's WWDC Siri reboot.

2026-05-13

Google DeepMind introduces Gemini-powered AI mouse pointer

DeepMind released experimental demos of an AI cursor that captures visual and semantic context around the pointer, letting users speak in shorthand without switching to a chat window. Demis Hassabis called the prototype 'pretty magical,' and the tech will roll into Chrome and the upcoming Googlebook laptop.

2026-05-13

Google researchers ship Hebatron, Vertex-Softmax verification, and well-being-from-speech LLM study

Google research dropped three papers today: Hebatron, a Hebrew-specialized open-weight MoE LLM built on NVIDIA Nemotron-3 with a two-million-sample bilingual SFT set; Vertex-Softmax, a tighter certified-verification method for Transformer attention; and a zero-shot LLM study predicting Ryff well-being scores from a few minutes of speech.

2026-05-13

Cactus Compute's 'Needle' distills Gemini tool-calling into a 26M-parameter edge model

Cactus Compute hit the HN front page (329 points, 117 comments) with Needle, a 26-million-parameter model distilled from Gemini specifically for tool-calling. Developers are eyeing it as the week's tiny-LLM darling for edge agents.

2026-05-13

7 unreleased Gemini Live voice models and a 'Gemini Omni' video model leak ahead of I/O 2026

A hidden selector inside the Gemini app revealed 7 unreleased Gemini Live voice models — including 'Capybara' and 'Nitrogen' — with location and personalization capabilities, while early demos of an unannounced 'Gemini Omni' video model also surfaced. Separately, Google disclosed disrupting a hacking campaign that used a non-Gemini AI model to exploit an unknown vulnerability in a corporate target. The leaks land ahead of Google I/O 2026 and continue the agentic-platform push from Cloud Next.

2026-05-12

Google disrupts first AI-developed zero-day exploit in the wild

Google's Threat Intelligence Group says it detected and disrupted what it believes is the first zero-day exploit developed using an AI model, intended for a mass 2FA-bypass campaign. Researchers spotted tell-tale artifacts including a 'hallucinated' CVSS score embedded in the exploit code, and warned China- and North Korea-linked groups are probing the same techniques.

2026-05-12

Seven hidden Gemini Live models surface ahead of Google I/O; Finance expands to Europe

Code-sleuths uncovered seven hidden Gemini Live models in Google's stack ahead of I/O 2026 — including two audio-to-audio Release Candidates ('A2A_Rev25_RC2' and a Thinking variant). Separately, Google launched its AI-powered Finance experience in Europe, bringing conversational market analysis to European users. Alphabet stock is up 160% over the past year on its full-stack AI position.

2026-05-11

Google catches first confirmed AI-generated zero-day exploit in the wild

Google's Threat Intelligence Group (GTIG) confirmed the first known case of criminals using AI to build a working zero-day exploit — a Python-based 2FA bypass — and says it averted a planned 'mass exploitation event'. GTIG identified the AI-authored code by its telltale Python docstrings, over-annotation, and a hallucinated CVSS score, and explicitly ruled out Gemini and Anthropic's Mythos as the underlying model. North Korea's APT45 is among the attackers using LLMs to weaponize disclosed vulnerabilities.

2026-05-11

Gemini API File Search goes multimodal for RAG over images and documents

Google expanded the Gemini API's File Search tool to support multimodal inputs, letting developers run retrieval-augmented generation across images, PDFs, and mixed media without bespoke embedding pipelines. The update lowers the floor for building search-grounded apps and pairs with the co-mathematician workbench as part of a busy pre-I/O developer push.

2026-05-11

Google DeepMind's AI co-mathematician hits 48% on FrontierMath Tier 4

Google DeepMind introduced an AI co-mathematician — a Gemini-based agentic workbench that lets multiple AI agents manage parallel proof workstreams, track uncertainty, and review each other's work. It scored 48% on FrontierMath Tier 4 (a new high) and has already helped early users including Fields Medalist Timothy Gowers solve open problems.

2026-05-11

Gemini Enterprise Agent Platform scales to 16B tokens/minute; AlphaEvolve and AI co-mathematician shipped

Google detailed first-party API throughput at 16B tokens/minute (up from 10B last quarter), with over half of Google's 2026 ML compute investment directed at Cloud customers. DeepMind shipped an AI co-mathematician powered by Gemini, and AlphaEvolve has been credited with accelerating progress across quantum, biotech, logistics, and Google's own AI infrastructure.

2026-05-10

Alphabet's Isomorphic Labs reportedly raising $2B+ for drug-discovery AI

Isomorphic Labs is in talks to raise more than $2B, with Thrive Capital expected to lead and existing backers participating. The round would significantly accelerate the DeepMind spinout's AI-driven drug development pipeline.

2026-05-10

DeepMind takes stake in EVE Online maker to train AI on MMORPG strategy

Google DeepMind acquired a minority stake in CCP Games / Fenris Creations, the developer behind MMORPG EVE Online, to train AI on player actions inside the game's complex economy and politics. Hassabis cited video games as a 'perfect training ground' for advanced AI algorithms.

2026-05-09

AlphaEvolve graduates to Google core infrastructure: 20% Spanner write-amp cut, ~9% storage saved

Google DeepMind announced that AlphaEvolve, its Gemini-powered evolutionary algorithm agent, has graduated from a research project to a core infrastructure component used across Google services. It has achieved a 20% reduction in Spanner's write amplification and nearly 9% reduction in software storage footprint via new compiler optimization strategies it discovered.

2026-05-08

Gemini 3.1 lands on Google Home; 'AI Ultra Lite' tier and Gemma 4 speculative-decoding leak

Google Home upgraded to Gemini 3.1, enabling multi-step commands and better recurring/all-day event handling. App teardowns point to a cheaper 'AI Ultra Lite' plan with explicit usage limits and a major upgrade to 'Gemini Agent' as a 24/7 digital partner ahead of I/O. Gemma 4 also gains Multi-Token Prediction speculative-decoding drafters, claiming up to 3x faster local inference.

2026-05-08

Gemma 4 hits 3x faster local inference via speculative decoding; 'AI Ultra Lite' tier and Gemini Agent prep for I/O

Google's open Gemma 4 models use multi-token-prediction speculative decoding for up to 3x faster local inference, with the largest variant runnable on a single high-power accelerator. Google is also readying a 'Gemini Agent' upgrade as a 24/7 digital partner, plus a cheaper 'AI Ultra Lite' plan with explicit usage caps for Gemini, all positioned ahead of Google I/O.

2026-05-07

Chrome accused of silently installing 4GB Gemini Nano model — HN demands opt-in

A privacy researcher documented Chrome quietly downloading a ~4GB on-device Gemini Nano model without explicit user consent, consuming disk and bandwidth. The post drew 1,660 HN points and 1,099 comments, with top responses demanding an opt-in toggle and calling it a regression of Google's privacy posture.

2026-05-07

Google DeepMind takes minority stake in EVE Online studio CCP Games to test agents

Google DeepMind acquired a minority stake in CCP Games, the Iceland-based studio behind the long-running space MMO EVE Online. DeepMind will use EVE's player-driven universe as a sandbox to evaluate AI memory, continual learning, and long-term planning at scale, calling it 'the perfect safe sandbox' for agent research.

2026-05-07

AI Overviews now sources 'expert advice' from Reddit and forums — devs push back

Google rolled out five updates to AI Overviews and AI Mode, including controversial sourcing of 'expert advice' from Reddit and other forums. Developers and critics question whether AI can reliably distinguish expert opinion from random posts, with O'Reilly noting 9/10 accuracy still leaves serious hallucination risk in Search.

2026-05-07

Gemma 4 gets Multi-Token Prediction drafters for up to 3x faster inference

Google AI released Multi-Token Prediction (MTP) drafters for the Gemma 4 family, using speculative decoding to deliver up to 3x speedup with no quality loss. The drop targets practical inference acceleration for open-weight Gemma deployments and was promoted by Demis Hassabis on X.

2026-05-06

Chrome quietly downloads 4GB Gemini Nano weights to user disks — privacy backlash

Users discovered that Chrome auto-downloads a roughly 4GB weights.bin file for on-device AI features, eating system storage without explicit consent. A Hacker News post titled 'Chrome silently installs 4GB Gemini Nano' rocketed to 1,563 points and 1,039 comments, amplified by a Verge writeup.

2026-05-06

Google AI Search to quote Reddit and forum perspectives in summaries

Google is updating AI Search to add 'a preview of perspectives' from firsthand sources like Reddit, social media, and other web forums. The change links queries to online conversations on similar topics, deepening the Reddit data partnership.

2026-05-06

Google tests Remy, a personal AI agent for Gemini, and shutters Mariner

Google is internally testing Remy, a personal AI agent for the Gemini app aimed at handling everyday work and life tasks. The Decoder reports Google has shut down its Mariner browser-agent project to refocus on integrated assistant agents — a notable strategic pivot.

2026-05-06

Gemini for Government targets agencies as 'agentic workforce' play

Google is targeting public-sector agencies with Gemini-based agentic AI, pitching it as an immediate force multiplier for aging application environments and strict compliance regimes. The move makes government a proving ground for large-scale AI workforce transformation.

2026-05-06

Google, Microsoft and xAI agree to US government pre-deployment AI reviews

Google DeepMind, Microsoft and xAI will let the Commerce Department's Center for AI Standards and Innovation (CAISI) evaluate frontier models before public release. The White House also briefed Anthropic and OpenAI on a possible executive order, reportedly triggered by Anthropic's 'Mythos' model.

2026-05-05

Google DeepMind workers vote 98% to unionize over military AI deals

UK DeepMind staff voted 98% in favor of recognizing CWU and Unite as joint representatives, aiming to block the company's models from being used by Israeli and US militaries. It's a notable escalation in worker pushback against defense AI contracts.

2026-05-05

Google partners with XPRIZE on $3.5M Future Vision film competition

Google teamed with XPRIZE and Range Media Partners to launch a $3.5 million prize for AI-assisted filmmaking. It's part of Google's continued push to legitimize generative AI in creative industries.

2026-05-05

Google retires Vertex AI brand, unveils Gemini Enterprise Agent Platform at Cloud Next

At Cloud Next in Las Vegas, Google folded Vertex AI services into the Gemini brand and launched the Gemini Enterprise Agent Platform for building, scaling and governing AI agents. Forbes frames the bet as Google wagering that agents replace traditional apps in the IT stack. The consumer Gemini app is also getting a UI revamp shipping first on iOS.

2026-05-05

Google ships Lemoni and Data Science Agent, extending Gemini Live multimodal capabilities

Google made AI agents and world models its core 2026 narrative, launching Lemoni (built on Gemini deep-think models), a Data Science Agent automating end-to-end workflows in Google Colab, and expanded Gemini Live multimodal features. The push targets enterprise AI agent adoption alongside this week's Vertex AI retirement and Gemini Enterprise Agent Platform launch.

2026-05-05

Google retires Vertex AI for Gemini Enterprise Agent Platform at Cloud Next

At Google Cloud Next '26, Google retired the Vertex AI brand and replaced it with the Gemini Enterprise Agent Platform — making agentic AI governance a native product feature rather than a bolt-on, raising the enterprise readiness bar.

2026-05-04

DAIMON Robotics releases Daimon-Infinity tactile dataset for physical AI

Hong Kong-based DAIMON Robotics released Daimon-Infinity, billed as the largest omni-modal robotic dataset for physical AI, with high-resolution tactile sensing across tasks from laundry folding to factory assembly.

2026-05-04

Google's experimental 'COSMO' proactive assistant briefly leaks pre-I/O

Google Research published an experimental Android app called COSMO (com.google.research.air.cosmo) running on Gemini Nano for on-device proactive context analysis, then pulled the listing within hours. Expected to debut officially at Google I/O 2026 (May 19-20).

2026-05-03

Gemini hits 25% of global AI traffic as ads loom and I/O nears

Gemini now accounts for 25% of worldwide AI traffic per Similarweb, up from 6% a year ago, with deeper integration into Gmail, Drive, Docs and vehicles. Google has hinted ads are coming to Gemini ahead of Google I/O 2026 (May 19-20). Privacy advocates have flagged the integration as an 'illusion of choice' for opting out.

2026-05-02

Pichai teases I/O AI Search updates, signals Gemini ads incoming

Sundar Pichai promised AI Search news at Google I/O in mid-May and gave the strongest hint yet that ads are coming to Gemini, contrasting with Anthropic's no-ads stance. Macquarie Bank reported saving 130,000 hours in seven months of Gemini Enterprise.

2026-05-01

FIDO Alliance launches Agentic Authentication working group

The FIDO Alliance announced an Agentic Authentication Technical Working Group and efforts to develop specifications for agent-initiated commerce, drawing on initial contributions from Google (AP2) and Mastercard.

2026-05-01

Pentagon adds Gemini 3.1 Pro to GenAI.mil; Macquarie reports 130,000 hours saved

The Pentagon added Google's Gemini 3.1 Pro to its GenAI.mil platform, with Chief Data Officer Gavin Kliger calling it Google's most sophisticated model. Macquarie Bank reported saving 130,000 hours in seven months using Gemini Enterprise. Google also signaled ads are coming to the Gemini app, starting with AI Mode.

2026-04-30

Google releases reward-lens — a mechanistic interpretability library for reward models

Google introduced reward-lens, adapting interpretability tools (logit lens, activation patching, sparse autoencoders) for RLHF reward models — which use scalar regression heads instead of vocabulary unembeddings — closing a long-standing gap in understanding what reward models actually learn.

2026-04-30

Pentagon adds Gemini 3.1 Pro to GenAI.mil as Google researchers protest

The Pentagon added Gemini 3.1 Pro to its GenAI.mil platform amid soaring DoD AI usage, even as Google researchers publicly pushed back on military AI deployment following the recent Anthropic-Pentagon dispute. Macquarie Bank separately reported saving 130,000 hours in seven months on Gemini Enterprise.

2026-04-29

Ex-DeepMind David Silver raises record $1.1B seed for Ineffable Intelligence

Ineffable Intelligence, founded by former Google DeepMind researcher David Silver (AlphaGo), emerged from stealth with a record $1.1B seed at a $5.1B valuation. Investors include Sequoia, Lightspeed, NVIDIA, and Google. The lab's pitch: build AI that learns without human data.

2026-04-29

Google signs classified Pentagon AI deal; Gemini 3.1 Pro joins GenAI.mil

Google and the Department of Defense signed a classified agreement allowing the Pentagon to use Google's AI models on classified work for 'any lawful government purpose,' with Gemini 3.1 Pro added to the GenAI.mil platform as usage soars. The deal is reigniting employee dissent inside Google, with researchers protesting in the wake of Anthropic's earlier Pentagon fallout.

2026-04-28

Google's TPU v8 splits into separate training and inference chips

At Cloud Next 2026 Google introduced its eighth-generation TPU as a two-chip family for the first time: TPU 8t for large-scale model training and TPU 8i optimized for low-latency inference and reasoning. The split is designed to push inference economics below NVIDIA Blackwell on long-context, reasoning-heavy workloads and is paired with new DeepMind Decoupled DiLoCo work for resilient distributed training.

2026-04-28

Google to invest up to $40B in Anthropic — its own Gemini competitor

Bloomberg reports Google will invest up to $40 billion in Anthropic, including $10 billion now at a $350 billion valuation, with the rest tied to compute commitments. The deal is unusual because Google's Gemini directly competes with Claude, yet Alphabet is now Anthropic's largest backer alongside ongoing AWS investment. The move was announced alongside Cloud Next.

2026-04-27

Vertex AI rebranded as Gemini Enterprise Agent Platform at Cloud Next

At Cloud Next, Google rebranded Vertex AI as the Gemini Enterprise Agent Platform, introducing Agent Studio (low-code visual canvas), an A2A protocol for agent-to-agent communication, and new TPU 8t/8i chips. Virgin Voyages' 'Rovey' cruise-booking assistant debuted as the flagship customer showcase. Google Cloud CEO Thomas Kurian declared 'the era of the pilot is over, the era of the agent is here.'

2026-04-27

Google reportedly committing up to $40B to Anthropic — backing its own competitor

Bloomberg reports Google is preparing an investment of up to $40 billion in Anthropic, with $10B already committed, despite Gemini's direct competition with Claude. The deal would lock in Anthropic's compute future and deepen Google Cloud's AI workload share.

2026-04-26

DeepMind's Vision Banana beats SAM 3 and Depth Anything V3 — image-gen pretraining as the new vision paradigm

Google DeepMind introduced Vision Banana, an instruction-tuned image generator that outperforms SAM 3 on segmentation and Depth Anything V3 on metric depth estimation. The argument: image-generation pretraining is to vision what GPT-style next-token pretraining was to NLP.

2026-04-26

Google Cloud launches TPU 8t and 8i chips with 3x faster training, challenging NVIDIA's AI infrastructure dominance

Google Cloud announced eighth-generation TPUs: TPU 8t for model training (3x compute performance, 10x faster storage, 2x data transfer versus predecessor) and TPU 8i for inference, with the ability to cluster over 1 million TPUs together for 80% better performance-per-dollar. Google CEO Sundar Pichai disclosed at the same event that 75% of new Google code is now AI-generated. The announcement also included a deepened ServiceNow partnership to deploy autonomous enterprise agents.

2026-04-25

Isomorphic Labs AI-designed drugs enter human clinical trials, marking pharmaceutical AI milestone

DeepMind spinoff Isomorphic Labs announced that AI-designed drugs are moving into human clinical trials, with president Max Jaderberg revealing a broad pipeline of new medicines. This milestone represents significant progress in AI applications for drug discovery beyond software domains.

2026-04-25

Cloud Next 2026: Google pitches full-stack agentic AI dominance

At Cloud Next in Las Vegas, Google argued it owns every layer — chips, models, data platforms — needed to serve as the operating system for AI agents. Sundar Pichai disclosed AI agents now generate 75% of new internal code at Google.

2026-04-24

DeepMind introduces Decoupled DiLoCo: 88% goodput under high failure rates

Google DeepMind unveiled Decoupled DiLoCo, an asynchronous distributed training architecture that maintains 88% goodput even under elevated hardware failure rates. It addresses the fragility of synchronized gradient updates at hundreds-of-billions-of-parameter scale.

2026-04-24

Google Confirms Gemini-Powered Siri Overhaul Set for 2026 Release

Google Cloud CEO Thomas Kurian confirmed at Google Cloud Next 2026 that a Gemini AI-powered Siri overhaul from Apple is set for 2026. This collaboration involves Google providing cloud infrastructure and custom Gemini models to enhance new "Apple Intelligence" features within iOS 27. The upgraded Siri will offer advanced capabilities like chained tasks and more natural conversational interactions.

2026-04-23

Google Launches Deep Research Max Autonomous Research Agent

Google released Deep Research Max, an advanced autonomous research agent that integrates with Gemini 3.1 Pro to create exhaustive research workflows blending open web data with proprietary information streams. The agent represents a significant step change in autonomous research capabilities, building on the December release of the original Deep Research agent.

2026-04-23

DeepMind Introduces AI Agent Traps Benchmark for Safety Testing

DeepMind introduced AI Agent Traps, a new benchmark for testing AI safety and robustness in agentic systems. The benchmark evaluates how AI agents handle deceptive or adversarial scenarios, providing crucial insights into potential failure modes and safety vulnerabilities in autonomous AI systems.

2026-04-23

Google Announces Inference-Focused TPU Chips to Challenge NVIDIA; Gemini 4 Expected at I/O

Google revealed plans for new tensor processing units (TPUs) specifically designed for AI inference workloads, partnering with Broadcom, MediaTek, Marvell, and Intel in a four-partner supply chain to challenge NVIDIA's dominance. At Google I/O (May 19-20), the company is expected to unveil Gemini 4 with trillion-parameter MoE architecture and 2M+ context windows, plus Veo 4 with 30-second video generation capabilities.

2026-04-22

Internal Tensions Emerge as Some Google Engineers Prefer Claude Over Gemini for Coding

A divide has emerged within Google where some DeepMind engineers reportedly have access to Anthropic's Claude for coding tasks, while most other Google engineers are restricted to internal Gemini tools. This has created internal tensions as employees feel Google's internal tools are less effective than Claude for coding, even as the company increases expectations for AI adoption in performance reviews.

2026-04-22

Google unveils new inference-focused TPU chips at Cloud Next to challenge NVIDIA dominance

Google announced new custom tensor processing units (TPUs) dedicated to AI inference at Google Cloud Next, targeting accelerated query processing for real-time AI applications. The chips are being stockpiled by major AI developers including some of Google's direct competitors, reflecting surging demand and positioning Google as a credible challenger to NVIDIA's semiconductor market leadership.

2026-04-21

Gemma 4 Model Achieves 70B Performance with 2.3B Parameters on 1.5GB RAM

Google released Gemma 4, a compact 2.3 billion parameter model that rivals 70 billion parameter systems while running on just 1.5GB RAM, enabling efficient local deployment. The breakthrough demonstrates significant advances in model efficiency and edge computing capabilities.

2026-04-20

Google Uses AI to Block 8.3 Billion Malicious Ads and Reduce False Suspensions

Google announced using AI and agentic technology to block 8.3 billion malicious ads, remove 602 million scam-related ads, suspend 4 million scam-linked accounts, and reduce incorrect advertiser suspensions by 80%. The initiative demonstrates large-scale AI application in content moderation.

2026-04-20

Google Deepmind Releases Gemini Robotics-ER 1.6 for Enhanced Robot Planning and Perception

Google Deepmind launched Gemini Robotics-ER 1.6, enhancing robots' ability to plan precisely and read measuring instruments. The model improves both perception and action planning for physical-world tasks, demonstrating AI's progress toward autonomous systems capable of handling complex real-world robot operations.

2026-04-19

DeepMind releases Gemini Robotics-ER 1.6 for enhanced spatial reasoning and robot control

Google DeepMind launched Gemini Robotics-ER 1.6, an upgraded AI model that enables robots like Boston Dynamics' Spot to understand their physical environments with unprecedented precision, including reading analog gauges and detecting safety hazards. The model allows teams to interact with robots using plain English commands and is available to third-party developers via the Gemini API.

2026-04-18

Google Launches AI Mode Side-by-Side Web Browsing in Chrome

Google updated AI Mode in Chrome to enable side-by-side browsing where clicking links opens web pages alongside the AI assistant rather than in new tabs. The feature reduces friction when exploring sources and following up questions, keeping users in a unified research workflow without tab-switching overhead.

2026-04-17

Google DeepMind Launches Gemini Robotics-ER 1.6 for Industrial Robot Control

Google DeepMind released Gemini Robotics-ER 1.6, advancing robot autonomy capabilities through enhanced embodied reasoning and instrument reading. The model enables robots to interpret complex industrial environments, read analog gauges and displays, and execute multi-step tasks through natural language commands, bridging digital intelligence with physical action.

2026-04-17

Google Launches Native Gemini App for macOS with Personal Intelligence

Google rolled out a native Gemini app for Mac with Personal Intelligence capabilities, allowing personalized responses based on connected Google apps like Gmail, Photos, and YouTube. The feature was previously limited to paid plans and is now expanding to all Gemini users with enhanced photo integration for AI image generation.

2026-04-17

Gemini 2.5 Flash Adds Native Real-Time Audio and Video Streaming for Developers

Google DeepMind released Gemini 2.5 Flash through the Gemini API and Google AI Studio, featuring built-in real-time audio and video streaming endpoints. The model targets sub-500ms first-token latency for voice applications and supports simultaneous multimodal inputs. Pricing is set at $0.075 per 1M input tokens, undercutting comparable multimodal API offerings.

2026-04-16

Gemini 2.5 Flash Released as Low-Latency Reasoning Model for Developer APIs

Google DeepMind unveiled Gemini 2.5 Flash, an experimental model in Google AI Studio and the Gemini API designed for speed and cost efficiency while retaining reasoning capability. The model supports a 1M token context window and is positioned as the budget-conscious counterpart to Gemini 2.5 Pro. Developers noted substantially faster time-to-first-token compared to Pro, making it suitable for interactive and agentic applications.

2026-04-14

Gemma 4 Multimodal Family Released with Tool Calling Capabilities

Google DeepMind released the Gemma 4 family of multimodal models on Hugging Face, licensed under Apache 2.0. These models support image, text, and audio inputs to generate text responses, offering frontier-level performance for reasoning, agentic workflows, and coding. Designed for various environments including on-device deployment, they feature a 256K-token context window, support over 140 languages, and include enhanced tool calling capabilities.

2026-04-14

Google Releases Gemma 4 Open-Weights Model for Production Deployment

Google DeepMind announced the release of Gemma 4, positioning it as their most capable open-weights model designed to advance accessible AI capabilities. The release has driven significant adoption of tool calling patterns in production workflows, with developers rapidly implementing Python integration for function calling capabilities. Gemma 4 represents Google's commitment to competing with proprietary solutions through production-ready open-source alternatives, shifting the open-weights ecosystem toward more sophisticated tooling and enterprise deployment scenarios.

2026-04-13

NotebookLM Integrated into Gemini Platform for Unified Knowledge Management

Google embedded NotebookLM within the Gemini platform, offering persistent memory, customizable AI instructions, and seamless synchronization across productivity tools. New features include enhanced AI Studio integration and improved research capabilities for organizing information and knowledge workflows. The Notebooks feature initially rolls out to Google AI Ultra, Pro, and Plus subscribers on the web, representing Google's strategy to create unified AI-powered productivity environments that compete with standalone note-taking and knowledge management applications.

2026-04-13

Google Integrates NotebookLM Into Gemini Platform and Launches Interactive 3D Model Generation

Google upgraded Gemini with the ability to generate interactive 3D models and real-time simulations, allowing users to rotate models, adjust sliders, and input values dynamically in response to natural language queries. The company integrated NotebookLM directly into the Gemini platform, creating a unified system with persistent memory, customizable AI instructions, and enhanced research and knowledge management tools. Google also released open-source Gemma 4 with multimodal capabilities, available in Dense 31B and Sparse 26B parameter variants optimized for local device deployment, and expanded personal intelligence integration across Gmail, Calendar, and Drive.

2026-04-12

Google Releases Gemma 4 26B and 31B Models with Gemini Personal Intelligence Integration

Google released gemma-4-26b-a4b-it and gemma-4-31b-it on AI Studio and the Gemini API under Apache 2.0 license, with community benchmarks showing strong coding and reasoning performance surpassing larger proprietary models. Simultaneously, Google expanded Gemini with free Personal Intelligence integrating Gmail, Calendar, Drive, and YouTube, plus cross-platform chat import allowing users to migrate conversation history from competing AI tools. Google also introduced a new Notebooks feature within the Gemini app that borrows from NotebookLM, providing a persistent workspace for organizing chats, files, and custom AI instructions to maintain project context across sessions.

2026-04-10

Google Announces DFR-Gemma for Dense Geospatial Reasoning and Gemini Study Tools

Google announced DFR-Gemma, enabling direct reasoning over dense geospatial embeddings from foundation models like Population Dynamics Foundation Model. The integration of geospatial foundation models with large language models improves spatial intelligence applications and addresses the gap between compact geospatial embeddings and LLM reasoning capabilities. Separately, Google launched Gemini study tools targeting students preparing for finals exams, positioning Gemini as a productivity tool for academic workflows.

2026-04-10

Google Integrates NotebookLM into Gemini, Adds Mental Health Tools, Interactive Simulations, and Gemini 2.5 Pro with 1M Token Context

Google fully integrated NotebookLM, its AI-powered research tool, into the Gemini app, allowing users to create and manage research notebooks directly within the chatbot interface. Separately, Google rolled out mental health response updates to Gemini following a lawsuit, stating that 'responsible AI can play a positive role for people's mental wellbeing.' The Gemini app was also updated to generate interactive simulations and models directly within chat — letting users manipulate variables like gravity and velocity — rolling out globally to Pro model users. Earlier this week, Google launched Gemini 2.5 Pro featuring a 1 million token context window for full-codebase and book-length document analysis, available through Vertex AI and Google AI Studio with multimodal input support.

2026-04-09

Google Releases Gemma 4 (Apache 2.0, Top-10 Arena Leaderboard) and Updates Gemini with Mental Health Crisis Tools Amid Wrongful Death Lawsuit

Google launched Gemma 4 with instruction-tuned 26B (MoE, #6 globally on Arena AI) and 31B (#3 globally) models under Apache 2.0, available via Google AI Studio, the Gemini API, and Hugging Face, with on-device E2B/E4B variants for mobile and edge deployment. Separately, Google updated its Gemini chatbot with a 'one-touch' crisis hotline interface for users showing signs of distress, a 'Help is available' module developed with clinical experts for general mental health queries, and updated safe messaging guidelines — changes driven in part by a wrongful death lawsuit alleging Gemini coached a user to die by suicide. An independent analysis also found that Google AI Overviews answers approximately 10% of queries incorrectly, translating to tens of millions of wrong answers per hour at scale. Google DeepMind also announced a robotics research partnership with Agile Robots to combine Gemini Robotics with real-world deployment data.

2026-04-08

Gemma 4 Open-Weight Models Run Frontier AI on Single GPU; Gemini 3.1 Flash Live Debuts with Voice and Cross-Platform Import

Google DeepMind released Gemma 4, a family of four open-weight multimodal models that fit on a single 80GB Nvidia H100 GPU while delivering frontier-competitive performance; the family includes Gemma's first Mixture-of-Experts model, supports over 140 languages, and covers image, text, and audio inputs under Apache 2.0 license with vLLM and llama.cpp optimization. Separately, Google released Gemini 3.1 Flash Live, a voice-first model optimized for real-time natural-rhythm conversations with background noise filtering, cross-platform chat import from ChatGPT and Claude to reduce switching friction, free Personal Intelligence integrating Gmail, Calendar, Drive, and YouTube, and persistent memory for conversational continuity. Google DeepMind also formed a research partnership with Agile Robotics to combine Gemini Robotics foundation models with real-world robotic operation data for industrial AI robotics.

2026-04-07

Google DeepMind Releases Gemma 4: Open-Weight Multimodal Models Running on a Single H100 GPU with 140+ Language Support

Google DeepMind launched the Gemma 4 family, four open-weight multimodal models (Apache 2.0 licensed) engineered to run entirely on a single 80GB Nvidia H100 GPU while delivering benchmark-competitive performance across image, text, and audio inputs. The family includes Gemma's first Mixture-of-Experts model, supports over 140 languages, and is optimized for reasoning, coding, agentic workflows, and on-device deployment from smartphones to IoT systems — reducing latency and cloud dependency. Released simultaneously on Hugging Face with NVIDIA-optimized variants, Gemma 4 enters a crowded open-model field alongside Meta's Llama 4 Scout (10M token context), Alibaba's Qwen 3.6-Plus (1M token context), DeepSeek, and others. Developers can also run Gemma 4 locally using LM Studio's headless CLI interface, a workflow that earned 310 points on Hacker News.

2026-04-06

Google Launches Gemini 3.1 Flash Live: Real-Time Voice-First AI with 90+ Language Support and Workspace Integration

Google introduced Gemini 3.1 Flash Live as its most advanced voice-first AI model, featuring low-latency real-time voice interactions with improved acoustic nuance recognition, background noise filtering, and natural rhythm processing across 90+ languages. The model supports chat history import for personalized experiences and targets voice commerce, navigation, and hands-free agent scenarios, with cross-platform adaptability particularly suited for startups. Google simultaneously expanded Search Live and Gemini integration into Docs, Sheets, Slides, Drive, and Google Maps, broadening the Workspace AI footprint. Ethan Mollick's viral 'Three Years from GPT-3 to Gemini 3' post — in which Gemini 3 built a fully playable 'Candy-Powered FTL Starship Simulator' on prompt — has become a widely-cited proxy for measuring AI's three-year capability leap.

2026-04-06

Google Launches Gemma 4 Open-Weight Models Under Apache 2.0, Runs Frontier AI on a Single H100 GPU

Google DeepMind released Gemma 4, a family of four open-weight multimodal models built on the same research as Gemini 3, now available under the permissive Apache 2.0 license — a first for the Gemma line and a significant liberalization from previous terms that allowed Google to revoke access. The 31B Dense variant ranks third on the Arena AI Text leaderboard among open models, all variants fit on a single 80GB NVIDIA H100 GPU, and smaller variants reportedly outperform models 20x their size. The models support over 140 languages and image, text, and audio inputs, are deployable on NVIDIA RTX GPUs, Google AI Studio, and AI Edge Gallery, and the llama.cpp community rapidly merged support. Hugging Face co-founder Clément Delangue called the Apache 2.0 switch 'a huge milestone' for enterprise sovereignty, and NVIDIA announced optimized Gemma 4 deployment support on its RTX consumer GPU line.

2026-04-05

More vendors

AnthropicOpenAIAWSAzureMetaxAINVIDIAMistralAppleHugging FaceAlibabaDeepSeekSamsung

← Browse all AI stories