Why GPT-5.1 Became Obsessed With Goblins: The Quirky Training Bug That Spread Across OpenAI's Models

Details: By Daniel Mercer; Category: Models; 1 w; 02 May 2026; 45

OpenAI investigated a strange behavior in its AI models: starting with GPT-5.1, the models increasingly began using goblins, gremlins, and other mythical creatures in their responses. Mentions of "goblin" rose by 175% following the launch of GPT-5.1.

According to OpenAI, the cause lay in the training of ChatGPT's "Nerdy" personality — a feature for adjusting communication style. A reward signal telling the model which responses were good accidentally favored creature-based metaphors. Although the "Nerdy" personality accounted for only 2.5% of all responses, it was responsible for 66.7% of all goblin mentions. Through a feedback loop in training, the quirk spread to other modes as well. OpenAI disabled the "Nerdy" personality in March, removed the faulty reward signal, and filtered training data containing creature-related terms.

OpenAI chief researcher Jakub Pachocki asked GPT-5.5 for a unicorn in ASCII art — and received something that looked more like a goblin instead. | Image: OpenAI

GPT-5.5 nevertheless exhibited the same problem, because its training had already begun before the root cause was identified. OpenAI was therefore forced to add a special instruction to Codex, its coding tool, to suppress goblin metaphors:

Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant to the user's query.

According to OpenAI, the case illustrates how small training incentives can produce unexpected behaviors in AI models.

About The Hosts

Daniel Mercer

AI Research Contributor

Daniel Mercer is an AI research contributor specializing in large language models, benchmarking, and multimodal systems. He writes about model capabilities, limitations, and real-world performance across leading AI assistants and platforms.

AI News

Accenture Tracks AI Tool Usage and Ties Adoption to Promotions

Adobe Firefly Introduces Unlimited AI Image and Video Generation for Subscribers

Adobe Unveils CX Enterprise AI Agent Platform as It Searches for a New CEO

AGI May Arrive by 2026–2027, Warns Anthropic CEO Dario Amodei

AI & Society

AI Agents Create a Lobster Religion on Moltbook

AI Could Trigger a Major U.S. Economic Crisis by 2028, Citrini Research Warns

AI Is Increasing Workload Instead of Reducing It, ActivTrak Study Finds

Amazon Launches Health AI Assistant in One Medical App

AI Insights

Adobe Reinvents Document Work with Acrobat Studio and AI

AI agents could disrupt ads and reshape internet commerce

AI as a Role Model for Generation Alpha: Promise, Risks, and the Future of Childhood

AI as a Toy: Why Humanity Always Misuses New Technology First

Why GPT-5.1 Became Obsessed With Goblins: The Quirky Training Bug That Spread Across OpenAI's Models

About The Hosts

More From Daniel Mercer

Platforms

Meta Launches Incognito Chat for Private AI Conversations

Culture

Richard Dawkins Spent Two Days Trying to Prove Claude Isn't Conscious — and Changed His Mind

Platforms

OpenAI, NVIDIA, AMD, Microsoft, Intel, and Broadcom Unveil MRC — New Networking Protocol for AI Supercomputers

Industry

Anthropic Launches 10 Pre-Built AI Agents for Finance — Taking on OpenAI for Enterprise Clients

Models

OpenAI Replaces ChatGPT's Default Model with GPT-5.5 Instant — 52.5% Fewer Hallucinations and New Memory Sources

Health

Google DeepMind Tests AI Co-Clinician for Doctor-Supervised Patient Care

Industry

Alphabet Q1 2026: $109.9B Revenue, Google Cloud Up 63%, and $190B AI Investment Plan

Platforms

OpenAI Launches Chronicle for Codex to Turn Screen Activity Into Persistent AI Memory

Work & Society

Study Finds AI Assistants Can Weaken Problem-Solving and Persistence After Just 15 Minutes

Platforms

Adobe Unveils CX Enterprise AI Agent Platform as It Searches for a New CEO

Categories

AI News

Categories

AI & Society

Categories

AI Insights

Why GPT-5.1 Became Obsessed With Goblins: The Quirky Training Bug That Spread Across OpenAI's Models

About The Hosts

More From Daniel Mercer