News

OpenAI introduces image generation for ChatGPT users powered by GPT-4o

OpenAI has integrated picture generation into ChatGPT, allowing users to use GPT-4o to create photos within the chatbot. This feature, available to subscribers with different subscription tiers, represents a significant step towards AI-driven communication, offering accuracy, adaptability, and interactiveness. Developers will soon have access to the API.

OpenAI has announced that it has integrated picture generation directly into ChatGPT. This means that users can use GPT-4o to create photos inside the chatbot.

Previously, users had to utilize DALL-E to create photos, either in ChatGPT or on an external platform. Now, OpenAI has used GPT-4o to directly integrate an even more sophisticated image-generating capability into ChatGPT.

The feature was made available to subscribers with Plus, Pro, Team, and Free subscription tiers on Tuesday, March 25.

Also Read: OpenAI intends to bill up to $20,000 a month for specialized AI “agents.”

The capability is a major step toward making image creation a crucial component of AI-driven communication, the business revealed in a press release.

The CEO of OpenAI, Sam Altman, called it “an incredible technology/product” on his X page. He emphasized that the function represents a new level of creative freedom, but he acknowledged that while people will produce amazing stuff, some outputs may upset people.

New features in the picture generation of GPT-4o

The image production in GPT-4o is intended to be more accurate, adaptable, and interactive than in earlier versions. According to OpenAI, the model is excellent in several crucial areas:

  • Text Rendering: GPT-4o produces accurate and understandable text, which makes it appropriate for diagrams, infographics, and annotated visuals, in contrast to earlier AI models that had trouble enclosing readable text within images.
  • Multi-turn Generation: By using dialogue to refine images, users can make gradual changes to visuals while preserving consistency between iterations. Storyboarding, branding, and character design are among the jobs that benefit from this capability.

Also Read: OpenAI adds ChatGPT to WhatsApp

  • Instructions to Follow: More accurate handling of complex cues allows GPT-4o to produce images with up to 10–20 different objects while preserving their relationships and characteristics.
  • In-Context Learning: The AI is helpful for visual brainstorming and design inspiration since it can examine uploaded photos and incorporate details into new projects.
  • Knowledge Integration: The model creates context-aware visuals like weather infographics, technical diagrams, and instructional illustrations by connecting its comprehension of text and images.

Also Read: OpenAI releases GPT-4.5: the “largest, most knowledgeable” model

Additionally, OpenAI disclosed that in the upcoming weeks, developers would have access to the API, allowing for a more thorough integration of GPT-4o’s picture capabilities across apps.

With plans to expand to Enterprise and Education members, the picture-generating feature is now available to ChatGPT users.

With the ability to specify colors, aspect ratios, and other design features, users can create graphics in ChatGPT by just stating their needs.

OpenAI stated that it could take up to a minute to render photos because of the model’s complexity.

Also Read: OpenAI Intends to Open Data Centers in India

OpenAI affirmed that DALL·E would remain accessible as a different model option for developers and companies seeking greater customization.

This post was last modified on March 28, 2025 11:09 pm

Kumud Sahni Pruthi

A postgraduate in Science with an inclination towards education and technology. She always looks for ways to help people improve their lives by putting complex things into simple words through her writing.

Recent Posts

Veo 2 extends access to more Gemini Advanced Users

Google has integrated Veo 2 video generation into the Gemini app for Advanced subscribers, enabling…

April 25, 2025

Perplexity launches the iPhone voice assistant

Perplexity's iOS app now makes its conversational AI voice assistant compatible with Apple devices, enabling…

April 24, 2025

Ola’s AI arm Krutrim intends to raise $300 million

Bhavish Aggarwal is in talks to raise $300 million for his AI company, Krutrim AI…

April 22, 2025

World’s first humanoid half-marathon pits people against robots

The Beijing Humanoid Robot Innovation Center won the Yizhuang Half-Marathon with the "Tiangong Ultra," a…

April 22, 2025

Cursor AI Code Editor: How to Use, Features, Pricing and Other Details Here

Cursor AI Code Editor is more than just a coding tool; it’s a comprehensive assistant…

April 22, 2025

Ray-Ban Meta AI Smart Glasses: Features, Types and FAQs

Ray-Ban Meta AI Smart Glasses are revolutionizing wearable tech with cutting-edge features like a 12…

April 22, 2025