Gemini 3 Deep Think Upgrade Sets New Reasoning Benchmarks

Category: Analysis

AI Research Contributor

13 February 2026

Listen On

Google DeepMind has upgraded its specialized reasoning mode, Gemini 3 Deep Think, and is making it available through the Gemini app as well as via an API in an early-access program on Vertex AI. The upgrade is designed to support complex tasks in science, research, and engineering. The Gemini app is available to Google AI Ultra subscribers, while developers and researchers can apply for access to the API program.

According to Google DeepMind, Deep Think achieves state-of-the-art results across several benchmarks: ARC-AGI-2 (a test of logical reasoning), Humanity’s Last Exam (challenging problems in mathematics, science, and engineering), and an Elo rating of 3,455 on the competitive programming platform Codeforces.

Benchmark	Deep Think	Claude Opus 4.6	GPT-5.2	Gemini 3 Pro Preview
ARC-AGI-2	84.6%	68.8%	52.9%	31.1%
Humanity’s Last Exam	48.4%	40.0%	34.5%	37.5%
Codeforces (Elo)	3,455	2,352	–	2,512

In addition, the model achieved gold-medal–level performance at the 2025 Physics and Chemistry Olympiads. Google DeepMind has also shared examples demonstrating the use of Deep Think in scientific research applications.

Daniel Mercer

AI Research Contributor

Daniel Mercer is an AI research contributor specializing in large language models, benchmarking, and multimodal systems. He writes about model capabilities, limitations, and real-world performance across leading AI assistants and platforms.

Podcast by Daniel Mercer

Recent Podcasts

Adobe Reinvents Document Work with Acrobat Studio and AI

Guides

AI News

Accenture Tracks AI Tool Usage and Ties Adoption to Promotions

Adobe Firefly Introduces Unlimited AI Image and Video Generation for Subscribers

Adobe Unveils CX Enterprise AI Agent Platform as It Searches for a New CEO

AGI May Arrive by 2026–2027, Warns Anthropic CEO Dario Amodei

AI & Society

AI Boom Drives Cybersecurity Hiring Despite Tech Sector Layoffs

Anthropic Expands Claude With New AI Tools for Legal Professionals

ChatGPT Adds Job Search and Resume Tools for Career Support

Chinese Court Rules Companies Cannot Fire Workers Solely for Being Replaced by AI

AI Insights

Adobe Reinvents Document Work with Acrobat Studio and AI

AI agents could disrupt ads and reshape internet commerce

AI as a Role Model for Generation Alpha: Promise, Risks, and the Future of Childhood

AI as a Toy: Why Humanity Always Misuses New Technology First

Gemini 3 Deep Think Upgrade Sets New Reasoning Benchmarks

Podcast by Daniel Mercer

OpenAI Says Shorter GPT-5.6 Sol Prompts Cut Tokens and Costs

OpenAI Launches GPT-5.6 and ChatGPT Work as AI Competition Intensifies

Recent Podcasts

Adobe Reinvents Document Work with Acrobat Studio and AI

AI as a Role Model for Generation Alpha: Promise, Risks, and the Future of Childhood

AI as a Toy: Why Humanity Always Misuses New Technology First

Categories

AI News

Categories

AI & Society

Categories

AI Insights

Gemini 3 Deep Think Upgrade Sets New Reasoning Benchmarks

Podcast by Daniel Mercer

Recent Podcasts