News

OpenAI Sora: Create Realistic and Imaginative Scenes From Texts

‘Sora’ – OpenAI new video-generation model to be launched soon. According to OpenAI’s introductory blog post, Sora “can create realistic and imaginative scenes from text instructions.” The text-to-video model allows users to create photorealistic videos up to a minute long, all based on prompts they’ve written.

OpenAI’s new video-generation tool, ‘Sora’ will add another realm for Artificial Intelligence (AI) based video creation. With a few simple prompts, ‘Sora’ can create realistic and imaginative scenes. 

OpenAI Sora can create “complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background,” according to the official release. The Sora  model can understand how objects “exist in the physical world,” and “accurately interpret props and generate compelling characters that express vibrant emotions.”

How OpenAI Sora is different from Synthesia, Runway, Deepbrain AI and InVideo

Sora can generate complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background. The model understands not only what the user has asked for in the prompt, but also how those things exist in the physical world.

Sora’s model has a deep understanding of language, enabling it to accurately interpret prompts and generate compelling characters that express vibrant emotions. Sora can also create multiple shots within a single generated video that accurately portray characters and their visual style.

Sora builds on past research in DALL·E and GPT models. It uses the recaptioning technique from DALL·E 3, which involves generating highly descriptive captions for the visual training data. As a result, the model can follow the user’s text instructions in the generated video more faithfully.

Must Read: Google Gemini vs OpenAI’s ChatGPT: A Battle of AI Titans Compared

To generate a video solely from text instructions, the model can take an existing still image and generate a video from it, animating the image’s contents with accuracy and attention to small details based on the prompt given. The model can also take an existing video and extend it or fill in missing frames. 

Sora text-to-video model is a diffusion that generates a video by starting with one that looks like static noise and gradually transforming it by removing the noise over many steps. Sora is capable of generating entire videos all at once or extending generated videos to make them longer. 

Text-to-video videos and images generated by Sora work as collections of smaller units of data called patches, each of which is akin to a token in GPT. By unifying the represented data, the Sora model is trained to diffuse transformers on a wider range of visual data spanning durations, resolutions, and aspect ratios.

Also Read: Microsoft-backed OpenAI Hits $2 Billion Revenue Mark; Eyes on Doubling by 2025

How Can I Use OpenAI Sora: Text-to-video Video Model

According to the OpenAI official details, Sora is becoming available to red teamers to assess critical areas for harm or risks. As per the official blog on OpenAI Sora, model access has been given to a number of visual artists, designers, and filmmakers to gain feedback on how to advance the model to be most helpful for creative professionals.

The current model has weaknesses. It may struggle with accurately simulating the physics of a complex scene and may not understand specific instances of cause and effect. For example, a person may take a bite out of a cookie, but afterward, the cookie may not have a bite mark.

Also Read: OpenAI’s Next Venture: Device Operation Automation with Advanced Agents

OpenAI Sora Generated Videos: Watch Here

SUV In The Dust

Art Museum

Grandma Birthday

Françoise

Francoise Hardy, A digital content creator and tech integration specialist with over 10 years of experience, is known for his deep knowledge in AI, ML, Data Science, Robotics, and Neural Networks. He began his career with a passion for emerging technologies, leading to innovative solutions and digital transformation in various businesses. Francoise's expertise extends to the ethical aspects of technology, advocating for responsible usage. Recognized by his peers, he is a sought-after speaker and writer in the tech industry. His commitment to advancing technology for societal benefit defines his career.

Recent Posts

AI ‘Godfather’ Geoffrey Hinton Advocates for Universal Basic Income Amid AI Advancements

AI pioneer Geoffrey Hinton warns of job losses and inequality due to AI, urging governments…

9 hours ago

What is Retrieval-Augmented Generation (RAG)?

Learn how RAG enhances the accuracy and relevance of generated content by dynamically integrating specific…

11 hours ago

How Does Bitcoin Mining Work?

Discover the process of Bitcoin mining, where transactions are verified and added to the blockchain,…

11 hours ago

Brain Teaser Challenge: Find the mistake in the kids playing picture in 9 seconds!

Can you find the mistake in the kids playing picture in 9 seconds? Test your…

13 hours ago

New Neuronal Structures Discovered Through Google Brain Mapping

Google scientists mapped a cubic millimetre of human brain tissue at nanoscale resolution, uncovering new…

16 hours ago

Meet the Young Indian Behind OpenAI’s GPT-4o Innovation

At OpenAI, Prafulla Dhariwal is in charge of the Omni team, and GPT-4o represents their…

16 hours ago