OpenAI GPT-5.2 Pro Sets New Record on FrontierMath Benchmark

Details: By Alex Rowland; Category: Models; 1 w; 38

There Is a New Best Math Model. OpenAI’s GPT-5.2 Pro Sets a New Record on FrontierMath

OpenAI’s GPT-5.2 Pro has achieved a new record on the highly challenging FrontierMath benchmark, according to tests conducted by Epoch AI. The model scored 31% on the hardest Tier 4 level, a significant leap from the previous best result of 19% set by Gemini 3 Pro. Due to API issues, Epoch AI tested the model manually via the ChatGPT web interface.

GPT-5.2 Pro’s performance clearly surpassed its closest competitors: Gemini 3 Pro (19%) and GPT-5.2 xhigh (17%). Source: Epoch AI

Out of 48 tasks, GPT-5.2 Pro successfully solved 15, including four problems that no previous model had managed to solve. Several professional mathematicians evaluated the solutions, largely praising their quality, though some criticized occasional lack of precision in the reasoning.

The results reinforce recent positive reports about advanced AI models — particularly GPT-5 Thinking and GPT-5 Pro — as powerful tools for solving complex mathematical problems. According to some accounts, GPT-5 has even solved Erdős problems autonomously, while in other cases it acted as an advanced assistant

About The Hosts

Alex Rowland

AI Industry Analyst

Is an AI industry analyst covering major AI platforms, enterprise adoption, and strategic moves by Big Tech companies. His work focuses on how AI systems are deployed at scale and how they reshape products, markets, and user behavior.

AI News

Adobe Firefly Introduces Unlimited AI Image and Video Generation for Subscribers

AGI May Arrive by 2026–2027, Warns Anthropic CEO Dario Amodei

AI Agent Beats 804 Human Programmers in Major Coding Tournament

AI Agents Can Now Hire Humans: Rentahuman.ai Turns Automation Into a Marketplace

AI & Society

AI Agents Create a Lobster Religion on Moltbook

Amazon Launches Health AI Assistant in One Medical App

DeepMind and Anthropic Warn AI Is Already Cutting Entry-Level Jobs

Doctors Welcome ChatGPT Health, Despite Ongoing Hallucination Risks

AI Insights

AI as a Role Model for Generation Alpha: Promise, Risks, and the Future of Childhood

AI as a Toy: Why Humanity Always Misuses New Technology First

AI as On-Chain Judge: Stanford Professor Proposes Using LLMs to Resolve Prediction Market Disputes

AI Investment Strategies: How Artificial Intelligence Is Reshaping Retail Investing

OpenAI GPT-5.2 Pro Sets New Record on FrontierMath Benchmark

About The Hosts

More From Alex Rowland

Platforms

Cipher Mining Raises $2B for HPC Data Center as Bitcoin Miners Pivot to AI Infrastructure

Platforms

Firefox 148 Adds Centralized AI Controls and One-Click AI Opt-Out

Industry

OpenAI–Nvidia $100B Mega-Deal Put on Ice

Platforms

Perplexity Signs $750M Azure Deal With Microsoft

Platforms

China Approves Nvidia H200 Chip Imports for ByteDance, Alibaba, and Tencent

Policy & Security

Dozens of “Nudify” Apps Found in App Store and Google Play, Enabling AI Deepfake Abuse

Policy & Security

Sam Altman Warns of AI Security Risks and Slower Hiring at OpenAI

Platforms

Alibaba’s Qwen-3 Becomes the First AI Model Deployed and Running in Orbit

Industry

OpenAI Tests Premium Ads in ChatGPT at $60 CPM

Platforms

Apple’s AI Shake-Up: Inside the Crisis That Led to Google Gemini Deal

Categories

AI News

Categories

AI & Society

Categories

AI Insights

OpenAI GPT-5.2 Pro Sets New Record on FrontierMath Benchmark

About The Hosts

More From Alex Rowland