Explore Alibaba's latest AI video generator, EMO, as it takes on OpenAI's Sora. From realistic singing performances to emotive facial animations, discover how these advanced systems are shaping the future of AI-driven video creation.
Alibaba EMO AI video generator
Alibaba’s Institute for Intelligent Computing has introduced EMO, a cutting-edge AI video generator poised to compete with OpenAI’s Sora. EMO showcases a remarkable ability to turn still images into lifelike actors and charismatic singers. In contrast to traditional artificial intelligence face-swapping techniques, EMO goes beyond mere mimicry, infusing emotions and expressions into its creations.
What Is OpenAI Sora? How To Use It?
The demos showcased on GitHub highlight EMO’s ability to make famous personalities like the Sora lady, known for wandering through AI-generated Tokyo, sing “Don’t Start Now” by Dua Lipa. The video illustrates EMO’s capacity to make static images speak and emote convincingly.
Notably, EMO differs from traditional AI face-swapping, providing more nuanced facial animations. It leverages a reference-attention mechanism and a separate audio-attention mechanism, using a vast dataset of audio and video for realistic remote expressions. The demo goes beyond lip movement, capturing subtle facial details between phrases, mirroring genuine human emotion.
Comparisons to other AI face animation frameworks like NVIDIA’s Audio 2Face highlight EMO’s superiority. While Audio 2Face relies on 3D animation, EMO generates photorealistic video, demonstrating a remarkable advancement in facial animation technology.
Also Read: NVIDIA Launches ‘Chat with RTX’: Local AI Chatbot Harnessing Personal Data
However, it’s crucial to acknowledge that the assessment is based on a demo, and practical application may require trial and error. The characters in the demos exhibit moderate emotions, leaving questions about EMO’s capability to handle extreme emotional expressions solely through audio cues.
Alibaba’s EMO and OpenAI’s Sora represent the forefront of AI video generation, pushing the boundaries of what’s possible in digital animation. While their capabilities are awe-inspiring, the implications for the future of entertainment and beyond are yet to be fully realized.
This post was last modified on March 1, 2024 8:14 am
Google is launching The Android Show: I/O Edition, featuring Android ecosystem president Sameer Samat, to…
The top 11 generative AI companies in the world are listed below. These companies have…
Google has integrated Veo 2 video generation into the Gemini app for Advanced subscribers, enabling…
Perplexity's iOS app now makes its conversational AI voice assistant compatible with Apple devices, enabling…
Bhavish Aggarwal is in talks to raise $300 million for his AI company, Krutrim AI…
The Beijing Humanoid Robot Innovation Center won the Yizhuang Half-Marathon with the "Tiangong Ultra," a…