OpenAI has announced that it has integrated picture generation directly into ChatGPT. This means that users can use GPT-4o to create photos inside the chatbot.
Previously, users had to utilize DALL-E to create photos, either in ChatGPT or on an external platform. Now, OpenAI has used GPT-4o to directly integrate an even more sophisticated image-generating capability into ChatGPT.
The feature was made available to subscribers with Plus, Pro, Team, and Free subscription tiers on Tuesday, March 25.
Also Read: OpenAI intends to bill up to $20,000 a month for specialized AI “agents.”
The capability is a major step toward making image creation a crucial component of AI-driven communication, the business revealed in a press release.
The CEO of OpenAI, Sam Altman, called it “an incredible technology/product” on his X page. He emphasized that the function represents a new level of creative freedom, but he acknowledged that while people will produce amazing stuff, some outputs may upset people.
New features in the picture generation of GPT-4o
The image production in GPT-4o is intended to be more accurate, adaptable, and interactive than in earlier versions. According to OpenAI, the model is excellent in several crucial areas:
- Text Rendering: GPT-4o produces accurate and understandable text, which makes it appropriate for diagrams, infographics, and annotated visuals, in contrast to earlier AI models that had trouble enclosing readable text within images.
- Multi-turn Generation: By using dialogue to refine images, users can make gradual changes to visuals while preserving consistency between iterations. Storyboarding, branding, and character design are among the jobs that benefit from this capability.
Also Read: OpenAI adds ChatGPT to WhatsApp
- Instructions to Follow: More accurate handling of complex cues allows GPT-4o to produce images with up to 10–20 different objects while preserving their relationships and characteristics.
- In-Context Learning: The AI is helpful for visual brainstorming and design inspiration since it can examine uploaded photos and incorporate details into new projects.
- Knowledge Integration: The model creates context-aware visuals like weather infographics, technical diagrams, and instructional illustrations by connecting its comprehension of text and images.
Also Read: OpenAI releases GPT-4.5: the “largest, most knowledgeable” model
Additionally, OpenAI disclosed that in the upcoming weeks, developers would have access to the API, allowing for a more thorough integration of GPT-4o’s picture capabilities across apps.
With plans to expand to Enterprise and Education members, the picture-generating feature is now available to ChatGPT users.
With the ability to specify colors, aspect ratios, and other design features, users can create graphics in ChatGPT by just stating their needs.
OpenAI stated that it could take up to a minute to render photos because of the model’s complexity.
Also Read: OpenAI Intends to Open Data Centers in India
OpenAI affirmed that DALL·E would remain accessible as a different model option for developers and companies seeking greater customization.