OpenAI Launches GPT Image 2 for ChatGPT with Thinking Mode, Web Search, and Multi-Image Consistency

Details: By Alex Rowland; Category: Models; 3 w; 22 April 2026; 65

OpenAI equips its image generator, ChatGPT Images 2.0, with reasoning capabilities and web search. The generator can now create up to eight consistent images from a single prompt and handles text in non-Latin scripts much better.

OpenAI’s new image model is official. ChatGPT Images 2.0 is based on the new GPT Image 2 model and, according to a company blog post, brings the same capability as Google’s Nano Banana Pro: the model “thinks” before generating, for a shorter or longer time depending on the selected mode, and can even search the internet while doing so. This is intended to enable greater diversity and accuracy in generated images. Expanded outputs with Thinking, however, are only available to ChatGPT Plus, Pro, and Business users.

With Thinking mode enabled, ChatGPT Images 2.0 can generate up to eight images at once from a single prompt. Characters, objects, and styles are supposed to remain consistent across all scenes. OpenAI cites manga pages, series of social media graphics, and design plans for different rooms in a house as example use cases.

ChatGPT Images 2.0 is a step change in detailed instruction following, placing and relating objects accurately, and rendering dense text, with the ability to generate across aspect ratios. Х

Better image quality for all users

Regardless of Thinking mode, all ChatGPT users receive improvements in image quality. According to OpenAI, the generator is better at capturing the “distinctive characteristics of photographs” and shows progress in pixel art, manga, film stills, and other image types. The model is also said to handle fine-grained elements that previous image models regularly struggled with: small text, iconography, UI elements, dense compositions, and subtle stylistic instructions.

Support for aspect ratios ranges from 3:1 (ultra-wide) to 1:3 (ultra-tall), covering formats from banners and presentation slides to mobile screens. Resolution goes up to 2K in the API.

API pricing: token-based and quality-dependent

Through the API, developers can integrate the model into their own products under the name gpt-image-2. OpenAI charges on a token basis: $8 per one million image input tokens and $30 per one million image output tokens. Text tokens cost $5 per million input tokens and $10 per million output tokens. Cached inputs are significantly cheaper.

In practice, the cost per image depends heavily on quality and resolution. According to OpenAI’s pricing overview, a 1024 × 1024 image costs just $0.006 in low quality, $0.053 in medium quality, and $0.211 in high quality. At larger resolutions such as 1024 × 1536, costs drop slightly to $0.005, $0.041, and $0.165 respectively.

Model	Quality	1024 × 1024	1024 × 1536	1536 × 1024
GPT Image 2	Low	$0.006	$0.005	$0.005
GPT Image 2	Medium	$0.053	$0.041	$0.041
GPT Image 2	High	$0.211	$0.165	$0.165
GPT Image 1.5	Low	$0.009	$0.013	$0.013
GPT Image 1.5	Medium	$0.034	$0.05	$0.05
GPT Image 1.5	High	$0.133	$0.2	$0.2

At larger formats, GPT Image 2 is cheaper than its predecessors: 1024 × 1536 in high quality costs $0.165 instead of $0.20 with GPT Image 1.5 and $0.25 with GPT Image 1.5. At the standard 1024 × 1024 resolution in high quality, however, the new model is more expensive at $0.211 compared with GPT Image 1.5 at $0.133. API outputs above 2K are still in beta and may produce inconsistent results.

OpenAI lists localized advertising, infographics, educational content, design tools, and creative platforms as use cases. In Codex, image generation is expected to be available directly inside the workspace without a separate API key.

In our own benchmark prompt, ChatGPT Image 2 performs exceptionally well. Both versions handle the complex and abstract prompt with high fidelity to the details.

A hyper-realistic DSLR photo. A monkey holding a pink banana is sitting on a tiger in the foreground. In the background, a HORSE is RIDING AN ASTRONAUT. The astronaut is underneath like a living “spacesuit horse saddle,” and the HORSE is clearly on top, in control, as the rider. Make it 100% unambiguous: the HORSE is the rider and the ASTRONAUT is being ridden, NOT the other way around. High-resolution, sharp focus, realistic lighting.

The instant model has a slightly artificial look, while the Thinking version delivers the DSLR requirement much more convincingly.

Bonus prompt: Turn this article into a magazine excerpt. Create an image of a two-page spread from an 80s-style BYTE-like magazine lying on a table. This is the text in the magazine. Add fitting images in between that supposedly show GPT Image 2-generated pictures with impressive detail and realism, but which in reality look the way people in the 1980s would have imagined something astonishing for a neural network — really awful quality, more like GANs or at best DALL·E 1. The magazine is called “THE DECODER.” — **Bonus prompt:** Turn this article into a magazine excerpt. Create an image of a two-page spread from an 80s-style BYTE-like magazine lying on a table. This is the text in the magazine. Add fitting images in between that supposedly show GPT Image 2-generated pictures with impressive detail and realism, but which in reality look the way people in the 1980s would have imagined something astonishing for a neural network — really awful quality, more like GANs or at best DALL·E 1. The magazine is called **“THE DECODER.”**

OpenAI’s new image model will be released soon. The model, which has been circulating for some time under the codename “gpt-image-2,” is already being tested by some ChatGPT users and on public leaderboards. In recent weeks, the first images have appeared on platforms like X and Reddit that are barely distinguishable from real photographs. So far, only testers in the U.S. or those with U.S.-based accounts seem to have gained access to the model.

Example of a fake photo generated with Image 2: Microsoft’s CEO proudly presents that the Google Chrome browser is most often downloaded via Edge. | Image: via X

The new model is expected to perform especially well on complex images and diagrams containing text. For example, it is said to be able to generate detailed screenshots. Accordingly, the model could also be useful for advertising and educational content such as infographics, since it renders text more reliably.

OpenAI announces the livestream for the new image model with a generated screenshot. | Image: OpenAI

The typical “AI look” with perfect lighting and smooth faces is also supposed to be fixed — a problem that still affected GPT-image 1.5. Until now, Google’s Nano Banana Pro clearly had the upper hand here. OpenAI is officially unveiling its new image model tonight in a livestream starting at 9 p.m. German time.

GPT Image 2 looks like a meaningful step forward because OpenAI is combining higher visual fidelity with reasoning, web-aware generation, and better consistency across multiple outputs. The most important practical improvements are not just prettier images, but stronger handling of text, layouts, UI-like compositions, and production-oriented formats that make the model more useful for real creative and commercial workflows.

About The Hosts

Alex Rowland

AI Industry Analyst

Is an AI industry analyst covering major AI platforms, enterprise adoption, and strategic moves by Big Tech companies. His work focuses on how AI systems are deployed at scale and how they reshape products, markets, and user behavior.

AI News

Accenture Tracks AI Tool Usage and Ties Adoption to Promotions

Adobe Firefly Introduces Unlimited AI Image and Video Generation for Subscribers

Adobe Unveils CX Enterprise AI Agent Platform as It Searches for a New CEO

AGI May Arrive by 2026–2027, Warns Anthropic CEO Dario Amodei

AI & Society

AI Agents Create a Lobster Religion on Moltbook

AI Could Trigger a Major U.S. Economic Crisis by 2028, Citrini Research Warns

AI Is Increasing Workload Instead of Reducing It, ActivTrak Study Finds

Amazon Launches Health AI Assistant in One Medical App

AI Insights

Adobe Reinvents Document Work with Acrobat Studio and AI

AI agents could disrupt ads and reshape internet commerce

AI as a Role Model for Generation Alpha: Promise, Risks, and the Future of Childhood

AI as a Toy: Why Humanity Always Misuses New Technology First

OpenAI Launches GPT Image 2 for ChatGPT with Thinking Mode, Web Search, and Multi-Image Consistency

Better image quality for all users

API pricing: token-based and quality-dependent

About The Hosts

More From Alex Rowland

Robotics

Waymo Recalls 3,800 Robotaxis After Flooded Road Incident in Austin

Policy & Security

AI-Powered Identity Theft Is Becoming a Major Fraud Threat in the U.S.

Platforms

Anthropic Doubles Claude Code Limits After SpaceX Compute Deal

Platforms

OpenAI's First Hardware Device Will Be an AI Smartphone — Mass Production Could Start in 2027

Robotics

Japan Airlines Tests Humanoid Robots at Haneda Airport to Combat Labor Shortage

Policy & Security

Anthropic Launches Claude Security: AI-Powered Code Vulnerability Scanner Powered by Opus 4.7

Platforms

Alphabet Beats Estimates as Google Cloud and AI Drive Record Growth

Models

DeepSeek V4-Pro Preview: New Open AI Model Challenges GPT-5.4 and Claude Opus 4.6

Models

OpenAI Launches GPT-5.5 as Its New Flagship AI Model for Agentic Workflows

Platforms

Google Unveils TPU 8, Gemini Enterprise Agent Platform, and Workspace Intelligence at Cloud Next ’26

Categories

AI News

Categories

AI & Society

Categories

AI Insights

OpenAI Launches GPT Image 2 for ChatGPT with Thinking Mode, Web Search, and Multi-Image Consistency

Better image quality for all users

API pricing: token-based and quality-dependent

About The Hosts

More From Alex Rowland