Open and Advanced Large-Scale Video Generative Models.
Open and Advanced Large-Scale Video Generative Models.
OmniHuman is a cutting-edge end-to-end AI framework developed by ByteDance, designed to generate highly realistic human videos from just a single image and an audio input, with advanced features such as lip sync, facial animation, and gesture synthesis. Whether you provide a portrait, half-body, or full-body photo, OmniHuman brings it to life with natural movements, expressive gestures, accurate lip synchronization to audio, and remarkable attention to detail. By combining multiple input types—such as images and audio—OmniHuman creates vivid, high-quality video results. The model is highly adaptable, supporting not only real human portraits but also animated or cartoon characters, making it suitable for a wide range of applications including content creation, singing, lip sync videos, and performance scenarios. 0.12$ per second.