Google launches Gemma 4 open AI models for reasoning and agent workflows

Details: By Daniel Mercer; Category: Models; 1 m; 05 April 2026; 143

Google has introduced Gemma 4, a new family of open AI models designed for advanced reasoning and agent-based workflows.

Gemma 4 AI model performance chart showing benchmark results, multimodal capabilities, and different model sizes

“Gemma 4 are our most intelligent open models to date. They deliver an unprecedented level of intelligence per parameter,” the company said.

We just released Gemma 4 — our most intelligent open models to date. Х

Since the launch of the first generation, developers have downloaded Gemma more than 400 million times, creating over 100,000 model variants within the Gemmaverse ecosystem. The latest version is built on the same research and technologies as the Gemini 3 chatbot.

Different sizes

The Gemma 4 family includes four variants: Effective 2B (E2B), Effective 4B (E4B), 26B Mixture of Experts (MoE), and 31B Dense.

The compact E2B and E4B models, with 2.3 billion and 4.5 billion active parameters, focus on multimodality, low latency, and seamless integration. They can run on smartphones or standard laptops.

The larger 26B MoE and flagship 31B models (with 26 billion and 31 billion parameters) require high-end GPU accelerators such as the Nvidia H100 with 80 GB of memory. These versions are optimized for researchers and developers.

The larger models show strong benchmark performance. In the global ranking of open text models Arena AI, the flagship 31B ranks third, while the 26B model takes sixth place. According to Google, the new lineup outperforms competing models that are up to 20 times larger.

Key capabilities

One of the main advantages of Gemma 4 is its advanced reasoning ability. The models can build complex logic chains and plan multi-step tasks. They show significant progress in math benchmarks and follow instructions with high precision.

Other features include:

Agent workflows: built-in support for function calling, structured JSON outputs, and system instructions enables the creation of autonomous assistants that interact with tools and APIs;
Code generation: Gemma 4 supports high-quality code generation in offline mode, effectively turning a workstation into a local AI assistant;
Vision and audio: all models can process video and images at variable resolutions, recognize text, and analyze diagrams. E2B and E4B also support speech recognition and understanding;
Extended context window: compact models support up to 128,000 tokens, while larger versions handle up to 256,000 tokens, enabling full repository or large document processing in a single query;
Multilingual support: the model family supports more than 140 languages.

Gemma 4 is already available in Google AI Studio and the Google AI Edge Gallery. Integration is also supported by popular third-party tools and frameworks, including Hugging Face, vLLM, llama.cpp, MLX, Ollama, NVIDIA NIM, and LM Studio.

The models can be fine-tuned via Google Colab, Vertex AI, or local GPUs. For production use, deployment is available on Google Cloud, including Cloud Run, GKE, and Sovereign Cloud.

About The Hosts

Daniel Mercer

AI Research Contributor

Daniel Mercer is an AI research contributor specializing in large language models, benchmarking, and multimodal systems. He writes about model capabilities, limitations, and real-world performance across leading AI assistants and platforms.

AI News

Accenture Tracks AI Tool Usage and Ties Adoption to Promotions

Adobe Firefly Introduces Unlimited AI Image and Video Generation for Subscribers

Adobe Unveils CX Enterprise AI Agent Platform as It Searches for a New CEO

AGI May Arrive by 2026–2027, Warns Anthropic CEO Dario Amodei

AI & Society

AI Agents Create a Lobster Religion on Moltbook

AI Boom Drives Cybersecurity Hiring Despite Tech Sector Layoffs

AI Could Trigger a Major U.S. Economic Crisis by 2028, Citrini Research Warns

AI Is Increasing Workload Instead of Reducing It, ActivTrak Study Finds

AI Insights

Adobe Reinvents Document Work with Acrobat Studio and AI

AI agents could disrupt ads and reshape internet commerce

AI as a Role Model for Generation Alpha: Promise, Risks, and the Future of Childhood

AI as a Toy: Why Humanity Always Misuses New Technology First

Google launches Gemma 4 open AI models for reasoning and agent workflows

Different sizes

About The Hosts

More From Daniel Mercer

Models

OpenAI Updates ChatGPT and Plans to Retire Older Models

Industry

DeepSeek Builds Code Harness to Rival Claude Code and Codex

Models

Cursor Releases Composer 2.5 AI Coding Model Based on Kimi K2.5

Platforms

Meta Launches Incognito Chat for Private AI Conversations

Culture

Richard Dawkins Spent Two Days Trying to Prove Claude Isn't Conscious — and Changed His Mind

Platforms

OpenAI, NVIDIA, AMD, Microsoft, Intel, and Broadcom Unveil MRC — New Networking Protocol for AI Supercomputers

Industry

Anthropic Launches 10 Pre-Built AI Agents for Finance — Taking on OpenAI for Enterprise Clients

Models

OpenAI Replaces ChatGPT's Default Model with GPT-5.5 Instant — 52.5% Fewer Hallucinations and New Memory Sources

Health

Google DeepMind Tests AI Co-Clinician for Doctor-Supervised Patient Care

Models

Why GPT-5.1 Became Obsessed With Goblins: The Quirky Training Bug That Spread Across OpenAI's Models

Categories

AI News

Categories

AI & Society

Categories

AI Insights

Google launches Gemma 4 open AI models for reasoning and agent workflows

Different sizes

About The Hosts

More From Daniel Mercer