News

Stability AI’s Stable Video 4D: Cutting-Edge 3D Video Synthesis from Single Video Input

Explore Stable Video 4D by Stability AI. This innovative model creates dynamic 3D videos from a single video input, offering 8 novel-view angles. Ideal for virtual reality, film editing, and game development. Available on Hugging Face.

Stability AI is pleased to present Stable Video 4D. This cutting-edge model offers a new degree of adaptability and creativity by enabling users to upload a single video and receive dynamic novel-view films from eight different angles.

As per the official blog, using a video as input, the Stable Video 4D model creates several novel-view videos from various viewpoints, building upon the strong foundation of our Stable Video Diffusion model, which turns images into videos. With this development, you can create full 3D dynamic video synthesis instead of only image-based videos, which is a significant improvement.

Also Read: Stability AI Launches Free Text-to-Sound Generator

With more growth anticipated, the model—which is presently in its research phase—may find use in virtual reality, film editing, and game development in the future. Hugging Face is now selling it. 

How It Operates

To begin, users upload a single movie and choose the 3D camera poses they want. After that, Stable Video 4D creates eight novel-view videos that follow the designated camera perspectives, giving viewers a thorough, multi-angle picture of the subject. Then, a dynamic 3D depiction of the video’s subject can be effectively optimized using the created videos.

At the moment, Stable Video 4D can produce 5-frame videos for each of the eight viewpoints in around 40 seconds, while the full 4D optimization takes 20 to 25 minutes. Researchers at Stability AI are looking forward to using virtual reality, video editing, and game development in the future. The capacity to view items from numerous angles can greatly aid professionals in several fields by improving the realism and immersion of their goods.

Also Read: Why Stability AI CEO Emad Mostaque Resigned

Cutting Edge Performance

As opposed to earlier methods, which frequently call for sampling from a mix of image diffusion models, video diffusion models, and multi-view diffusion models, SV4D may produce many novel-view films simultaneously, significantly enhancing the consistency in the temporal and spatial axes. This capability allows for a more lightweight 4D optimization framework without the laborious score distillation sampling (SDS) with various diffusion models, while also guaranteeing constant object display across multiple views and timestamps.

Investigation and Creation

A significant milestone for Stability AI has been reached with the release of Stable Video 4D, the first video-to-video generation model, on Hugging Face. Stability AI current focus is on improving the model’s performance so that it can process films from real-world sources in addition to the artificial datasets used for training.

Also Read: Stability AI Introducing Stable Video Diffusion: A Leap in Generative Video AI

The Stability AI team is committed to ongoing innovation and investigating applications of this and other technologies in practical settings. They expect businesses to use their approach and refine it even more to meet their own needs. They are eager to watch how this technology develops further as research and development continue since it has enormous promise for producing realistic, multi-angle videos.

Alongside this announcement, Stability AI is also publishing an extensive technical study that outlines the approaches used, difficulties encountered, and innovations made throughout the model’s development. 

Also Read: Meta unveils its largest ‘open’ AI Model to date

This post was last modified on July 25, 2024 5:59 am

Kumud Sahni Pruthi

A postgraduate in Science with an inclination towards education and technology. She always looks for ways to help people improve their lives by putting complex things into simple words through her writing.

Recent Posts

Google is moving Android news to a virtual event before I/O

Google is launching The Android Show: I/O Edition, featuring Android ecosystem president Sameer Samat, to…

April 29, 2025

Top Generative AI Companies of the World 2025

The top 11 generative AI companies in the world are listed below. These companies have…

April 28, 2025

Veo 2 extends access to more Gemini Advanced Users

Google has integrated Veo 2 video generation into the Gemini app for Advanced subscribers, enabling…

April 25, 2025

Perplexity launches the iPhone voice assistant

Perplexity's iOS app now makes its conversational AI voice assistant compatible with Apple devices, enabling…

April 24, 2025

Ola’s AI arm Krutrim intends to raise $300 million

Bhavish Aggarwal is in talks to raise $300 million for his AI company, Krutrim AI…

April 22, 2025

World’s first humanoid half-marathon pits people against robots

The Beijing Humanoid Robot Innovation Center won the Yizhuang Half-Marathon with the "Tiangong Ultra," a…

April 22, 2025