News

OpenAI introduces image generation for ChatGPT users powered by GPT-4o

OpenAI has integrated picture generation into ChatGPT, allowing users to use GPT-4o to create photos within the chatbot. This feature, available to subscribers with different subscription tiers, represents a significant step towards AI-driven communication, offering accuracy, adaptability, and interactiveness. Developers will soon have access to the API.

OpenAI introduces the o3 and o4-mini AI reasoning models.

OpenAI has announced that it has integrated picture generation directly into ChatGPT. This means that users can use GPT-4o to create photos inside the chatbot.

Previously, users had to utilize DALL-E to create photos, either in ChatGPT or on an external platform. Now, OpenAI has used GPT-4o to directly integrate an even more sophisticated image-generating capability into ChatGPT.

The feature was made available to subscribers with Plus, Pro, Team, and Free subscription tiers on Tuesday, March 25.

Also Read: OpenAI intends to bill up to $20,000 a month for specialized AI “agents.”

The capability is a major step toward making image creation a crucial component of AI-driven communication, the business revealed in a press release.

The CEO of OpenAI, Sam Altman, called it “an incredible technology/product” on his X page. He emphasized that the function represents a new level of creative freedom, but he acknowledged that while people will produce amazing stuff, some outputs may upset people.

New features in the picture generation of GPT-4o

The image production in GPT-4o is intended to be more accurate, adaptable, and interactive than in earlier versions. According to OpenAI, the model is excellent in several crucial areas:

Text Rendering: GPT-4o produces accurate and understandable text, which makes it appropriate for diagrams, infographics, and annotated visuals, in contrast to earlier AI models that had trouble enclosing readable text within images.
Multi-turn Generation: By using dialogue to refine images, users can make gradual changes to visuals while preserving consistency between iterations. Storyboarding, branding, and character design are among the jobs that benefit from this capability.

Also Read: OpenAI adds ChatGPT to WhatsApp

Instructions to Follow: More accurate handling of complex cues allows GPT-4o to produce images with up to 10–20 different objects while preserving their relationships and characteristics.
In-Context Learning: The AI is helpful for visual brainstorming and design inspiration since it can examine uploaded photos and incorporate details into new projects.
Knowledge Integration: The model creates context-aware visuals like weather infographics, technical diagrams, and instructional illustrations by connecting its comprehension of text and images.

Also Read: OpenAI releases GPT-4.5: the “largest, most knowledgeable” model

Additionally, OpenAI disclosed that in the upcoming weeks, developers would have access to the API, allowing for a more thorough integration of GPT-4o’s picture capabilities across apps.

With plans to expand to Enterprise and Education members, the picture-generating feature is now available to ChatGPT users.

With the ability to specify colors, aspect ratios, and other design features, users can create graphics in ChatGPT by just stating their needs.

OpenAI stated that it could take up to a minute to render photos because of the model’s complexity.

Also Read: OpenAI Intends to Open Data Centers in India

OpenAI affirmed that DALL·E would remain accessible as a different model option for developers and companies seeking greater customization.

This post was last modified on March 28, 2025 11:09 pm

Kumud Sahni Pruthi

A postgraduate in Science with an inclination towards education and technology. She always looks for ways to help people improve their lives by putting complex things into simple words through her writing.