'Not just generating images. It’s thinking' — Chat GPT Images 2.0 could fundamentally change how you make AI images | Tech Radar
Overview
News, deals, reviews, guides and more on the newest computing gadgets
Start exploring exclusive deals, expert advice and more
Details
Unlock and manage exclusive Techradar member rewards.
'Not just generating images. It’s thinking' — Chat GPT Images 2.0 could fundamentally change how you make AI images
Open AI’s new model focuses on better interpretation of complex image prompts.
When you purchase through links on our site, we may earn an affiliate commission. Here’s how it works.
Unlock instant access to exclusive member features.
Get full access to premium articles, exclusive features and a growing list of member rewards.
The new AI image model improves on its predecessor with more accurate, structured, and consistent visuals
The update adds a reasoning step that helps the system better interpret complex prompts and brings Chat GPT closer to Gemini’s multimodal strengths
Open AI has released a major update to Chat GPT's image generator. The company claims the new Chat GPT Images 2.0 is a shift in how the AI chatbot handles visual requests, moving from quick interpretation to something closer to deliberate construction. Open AI CEO Sam Altman and his team, in a livestream announcement, pointed to how the images now behave more like answers, built from an understanding of what you asked rather than a loose approximation of it.
"Images 2.0 is a huge step forward," Altman said. "It's like going from GPT-3 to GPT-5 all at once. Its ability to make extremely beautiful things is remarkable. The team really cooked with this one, and we can't wait to see what you'll do with it."
The most immediate improvement shows up in places that used to break down. Text inside images is the obvious example. Posters, menus, slides, and anything that relies on words being legible has traditionally been unreliable. Letters would warp, spacing would drift, and meaning would get lost.
Open AI introduces Chat GPT 5.4 Thinking for solving bigger problems
I upgraded my AI image prompts using Gemini’s advice: it changed everything
Chat GPT’s backup model just got smarter — as Open AI adds a new Pro option
It also handles structure more confidently. If you ask for a layout with specific elements in specific places, the result is more likely to reflect that intent. The model appears to treat the prompt less like a suggestion and more like a set of instructions.
This shows up in smaller ways as well. Multiple images generated from the same idea tend to stay visually consistent, whether that means keeping a character recognizable or maintaining a shared style across a set.
The bigger change is the reasoning step Chat GPT Images 2.0 adds before generation, allowing the model to work through a prompt before committing to a final output.
In practice, this means it can break a request into parts, decide how those parts should fit together, and then produce an image that reflects that internal plan. It can also draw on additional context like uploaded files or other sources online. That means it takes a little longer to get the image, but it makes for a better result and presumably will save you time by not requiring repeated attempts.
This is where image generation starts to resemble the behavior of advanced text models. The process is no longer purely reactive. It is interpretive. The output reflects a sequence of decisions rather than a single pass.
That shift matters most when the request has multiple layers. A multi-part design or a narrative sequence benefits from the system’s ability to hold those pieces together.
As the competition in multimodal AI heats up, Open AI can now point to Chat GPT Images 2.0 as a stronger rival to Google Gemini. Gemini has focused heavily on connecting text, images, and context into a single system, connecting across digital ecosystems. It often looked better than Chat GPT's images in that contest. But Chat GPT Images 2.0 narrows that gap.
Nano Banana 2 shows off its powerful image creation enhancements
Open AI introduces new Chat GPT 5.3 Instant to ‘reduce the cringe’ for users
Better reasoning, notably with text, means Chat GPT can muscle in on Gemini’s strengths in structured, multimodal tasks. It doesn't make Chat GPT a clear winner, but it does put it closer to parity in more ways.
Text models have already set a standard for fluid, context-aware responses. Bringing that same kind of reasoning into image generation starts to unify the experience. Whether you are writing something or visualizing it, the system is working from the same underlying understanding. That's where tools like Chat GPT and Gemini are clearly heading, and this update feels like a step that makes that convergence tangible.
Ultimately, a reduction in friction and improvement in images is what most users care about. If Chat GPT Images 2.0 can stand out as the best option, Google might have more trouble enticing users to migrate or stay in its own AI bubble.
Follow Tech Radar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your feeds. Make sure to click the Follow button!
And of course you can also follow Tech Radar on Tik Tok for news, reviews, unboxings in video form, and get regular updates from us on Whats App too.
➡️ Read our full guide to the best business laptops
- Best overall: Dell Precision 5690
- Best on a budget: Acer Aspire 5
- Best Mac Book: Apple Mac Book Pro 14-inch (M4)
Eric Hal Schwartz is a freelance writer for Tech Radar with more than 15 years of experience covering the intersection of the world and technology. For the last five years, he served as head writer for Voicebot.ai and was on the leading edge of reporting on generative AI and large language models. He's since become an expert on the products of generative AI models, such as Open AI’s Chat GPT, Anthropic’s Claude, Google Gemini, and every other synthetic media tool. His experience runs the gamut of media, including print, digital, broadcast, and live events. Now, he's continuing to tell the stories people want and need to hear about the rapidly evolving AI space and its impact on their lives. Eric is based in New York City.
You must confirm your public display name before commenting
1'Pushpaganda is, at the highest level, a case of social engineering': Experts warn scammers are flooding Google Discover with AI-generated content spreading malicious notifications
2'The evidence is starting to mount': physicists at the LHC have found a possible 'anomaly' that could unlock 'a new understanding of how the universe works' — and 'charming penguins' may hold the key to whether the Standard Model is out of date
3I tried my best not to love Dali's entry-level bookshelf speakers straight away, and my outright failure proves just how good they are
4'This is not goodbye' : Tim Cook makes it clear he's not walking away from Apple
5 Iran alleges systematic sabotage of US-made networking infrastructure mid-conflict — hardware shut down and rebooted despite internet blackout
Tech Radar is part of Future US Inc, an international media group and leading digital publisher. Visit our corporate site.
© Future US, Inc. Full 7th Floor, 130 West 42nd Street, New York, NY 10036.
Key Takeaways
-
News, deals, reviews, guides and more on the newest computing gadgets
-
Start exploring exclusive deals, expert advice and more
-
Unlock and manage exclusive Techradar member rewards
-
'Not just generating images
-
Open AI’s new model focuses on better interpretation of complex image prompts



